Compositions for the detection of enzyme activity in biological samples and methods of use thereof

ABSTRACT

The present invention provides for novel reagents whose fluorescence increases in the presence of particular proteases. The reagents comprise a characteristically folded peptide backbone each end of which is conjugated to a fluorophore. When the folded peptide is cleaved, as by digestion with a protease, the fluorophores provide a high intensity fluorescent signal at a visible wavelength. Because of their high fluorescence signal in the visible wavelengths, these protease indicators are particularly well suited for detection of protease activity in biological samples, in particular in frozen tissue sections. Thus this invention also provides for methods of detecting protease activity in situ in frozen sections.

This is a continuation-in-part of PCT/US98/0300, filed on Feb. 20, 1998, designating the United States, which is a continuation-in-part of U.S. application Ser. No. 08/802,981, filed on Feb. 20, 1997, now U.S. Pat. No. 6,037,137, all of which are herein incorporated by reference in their entirety for all purposes.

FIELD OF THE INVENTION

This invention pertains to a class of novel fluorogenic compositions whose fluorescence level increases in the presence active proteases. These fluorogenic protease indicators typically fluoresce at visible wavelengths and are thus highly useful for the detection and localization of protease activity in biological samples.

BACKGROUND OF THE INVENTION

Proteases represent a number of families of proteolytic enzymes that catalytically hydrolyze peptide bonds. Principal groups of proteases include metalloproteases, serine porteases, cysteine proteases and aspartic proteases. Proteases, in particular serine proteases, are involved in a number of physiological processes such as blood coagulation, fertilization, inflammation, hormone production, the immune response and fibrinolysis.

Numerous disease states are caused by and can be characterized by alterations in the activity of specific proteases and their inhibitors. For example emphysema, arthritis, thrombosis, cancer metastasis and some forms of hemophilia result from the lack of regulation of serine protease activities (see, for example, Textbook of Biochemistry with Clinical Correlations, John Wiley and Sons, Inc. N.Y. (1993)). In case of viral infection, the presence of viral proteases have been identified in infected cells. Such viral proteases include, for example, HIV protease associated with AIDS and NS3 protease associated with Hepatitis C. These viral proteases play a critical role in the virus life cycle.

Proteases have also been implicated in cancer metastasis. Increased synthesis of the protease urokinase has been correlated with an increased ability to metastasize in many cancers. Urokinase activates plasmin from plasminogen which is ubiquitously located in the extracellular space and its activation can cause the degradation of the proteins in the extracellular matrix through which the metastasizing tumor cells invade. Plasmin can also activate the collagenases thus promoting the degradation of the collagen in the basement membrane surrounding the capillaries and lymph system thereby allowing tumor cells to invade into the target tissues (Dano, et al. (1985) Adv. Cancer. Res., 44: 139.

Clearly measurement of changes in the activity of specific proteases is clinically significant in the treatment and management of the underlying disease states. Proteases, however, are not easy to assay. Typical approaches include ELISA using antibodies that bind the protease or RIA using various labeled substrates. With their natural substrates assays are difficult to perform and expensive. With currently available synthetic substrates the assays are expensive, insensitive and nonselective. In addition, many “indicator” substrates require high quantities of protease which results, in part, in the self destruction of the protease.

Recent approaches to protease detection rely on a cleavage-induced spectroscopic change in a departing chromogen or fluorogen located in the P1′ position (the amino acid position on the carboxyl side of the cleavable peptide bond) (see, for example U.S. Pat. Nos. 4,557,862 and 4,648,893). However, many proteases require two or four amino acid residues on either side of the scissile bond for recognition of the protease (a specific protease may require up to 6 amino acid residues) and thus, these approaches lack protease specificity.

Recently however, fluorogenic indicator compositions have been developed in which a “donor” fluorophore is joined to an “acceptor” chromophorc by a short bridge containing a (7 amino acid) peptide that is the binding site for an HIV protease and linkers joining the fluorophore and chromophore to the peptide (Wang et al. (1990) Tetra. Letts. 45: 6493-6496). The signal of the donor fluorophore was quenched by the acceptor chromophore through a process believed to involve resonance energy transfer (RET). Cleavage of the peptide resulted in separation of the chromophore and fluorophore, removal of the quench and a subsequent signal was measured from the donor fluorophore.

Unfortunately, the design of the bridge between the donor and the acceptor led to relatively inefficient quenching limiting the sensitivity of the assay. In addition, the chromophore absorbed light in the ultraviolet range reducing the sensitivity for detection in biological samples which typically contain molecules that absorb strongly in the ultraviolet.

Clearly fluorogenic protease indicators that show a high signal level when cleaved, and a very low signal level when intact, that show a high degree of protease specificity, and that operate exclusively in the visible range thereby rendering them suitable for use in biological samples are desirable. The compositions of the present invention provide these and other benefits.

SUMMARY OF THE INVENTION

The present invention provides for novel reagents whose fluorescence increases in the presence of particular proteases. These fluorogenic protease indicators provide a high intensity fluorescent signal at a visible wavelength when they are digested by a protease. Because of their high fluorescence signal in the visible wavelengths, these protease indicators are particularly well suited for detection of protease activity in biological samples, in particular, in frozen tissue sections. The measurement can be carried out using a fluorescence microscope for histological samples and using a flow cytometer for cell suspension samples. Hence, the fluorogenic compositions of this invention allow detection of intracellular protease activity.

The fluorogenic protease indicators of the present invention are compositions suitable for detection of the activity of a protease. These compositions have the general formula:

in which P is a peptide comprising a protease binding site for said protease consisting of 2 to about 15, preferably 2 to about 12, preferably 2 to about 10, preferably 2 to about 8, 2 to about 6, or 2 to about 4 amino acids; F¹ and F² are fluorophores; S¹ and S² are peptide spacers ranging in length from 1 to about 50 amino acids; n and k are independently 0 or 1; and C¹ and C² are conformation determining regions comprising peptides ranging in length from 1 to about 8, amino acids, more preferably from 1 to about 6 amino acids. The conformation determining regions each introduce a bend into the composition or otherwise restrict the degrees of freedom of the peptide backbone, thereby juxtaposing the fluorophores with a separation of less than about 100 Å. When either of the spacers (S¹ and S²) are present they are linked to the protease binding site by a peptide bond to the alpha carbon of the terminal amino acid. Thus, when i is 1, S¹ is joined to C¹ by a peptide bond through a terminal α-amino group of C¹, and when r is 1, S² is joined to C² by a peptide bond through a terminal alpha carboxyl group of C².

The amino acid residues comprising a protease binding site are, by convention, numbered relative to the peptide bond hydrolyzed by a particular protease. Thus the first amino acid residue on the amino side of the cleaved peptide bond is designated P₁ while the first amino acid residue on the carboxyl side of the cleaved peptide bond is designated P₁′. The numbering of the residues increases with distance away from the hydrolyzed peptide bond. Thus a four amino acid protease binding region would contain amino acids designated: P₂—P₁—P₁′—P₂′ and the protease would cleave the binding region between P₁ and P₁′.

In particularly preferred embodiments, the fluorogenic compositions of this invention are compositions of Formula II and Formula V as described herein. Preferred fluorophores have conformation determining regions and, optionally, spacers as described herein. In a most preferred embodiment, the compositions bear a single species of fluorophore. Fluorophores suitable for these “homolabeled” compositions include fluorophores that form H-type dimers. Particularly preferred fluorophores have an excitation wavelength between about 315 nm and about 700 nm.

In another embodiment, this invention provides methods of detecting the activity of a protease. The methods involve contacting the protease with one or more of the protease indicators described herein. In a particularly preferred embodiment, the “contacting” is in a histological section or in a cell suspension or culture derived from a biological sample selected from the group consisting of a tissue, blood, urine, saliva, or other biofluid, lymph, biopsy. The detection method can include a method selected from the group consisting of fluorescence microscopy, fluorescence microplate reader, flow cytometry, fluorometry, absorption spectroscopy.

In preferred compositions F¹ may be 5- and/or 6-carboxytetramethylrhodamine; and F² may be rhodamine X acetamide. These compositions may be conjugated to a solid support or to a lipid including membrane lipids or liposomes.

In another embodiment, any of the compositions described above may be used in a method for detecting protease activity in a sample. The sample may be a sample of “stock” protease, such as is used in research or industry, or it may be a biological sample. Thus, this invention provides for a method of detecting protease activity in a sample by contacting the sample with any of the compositions described above and then detecting a change in fluorescence of the fluorogenic composition where an increase in fluorescence indicates protease activity. The sample is preferably a biological sample which may include biological fluids such as sputum or blood, tissue samples such as biopsies or sections, and cell samples either as biopsies or in culture. Particularly preferred are tissue sections, cultured cells, cultured tissues, and the like.

In still yet another embodiment, this invention provides a method of delivering a molecule into a cell. The method involves providing the molecule attached to at least two fluorophore molecules and a hydrophobic group; and contacting the cell with the molecule whereby the molecule enters the cell. In one embodiment, the method involves providing the molecule attached to at least two largely flat hydrophobic fluorophore molecules and a hydrophobic group. Preferred molecules include a polypeptide, a nucleic acid, a lipid, an oligosaccharide. Suitable fluorophores and hydrophobic groups are described herein. Preferred cells include mammalian cells.

Definitions

The term “protease binding site” is used herein to refers to an amino acid sequence that is characteristically recognized and cleaved by a protease. The protease binding site contains a peptide bond that is hydrolyzed by the protease and the amino acid residues joined by this peptide bond are said to form the cleavage site. These amino acids are designated P₁ and P₁′ for the residues on the amino and carboxyl sides of the hydrolyzed bond respectively.

A fluorophore is a molecule that absorbs light at a characteristic wavelength and then re-emits the light most typically at a characteristic different wavelength. Fluorophores are well known to those of skill in the art and include, but are not limited to rhodamine and rhodamine derivatives, fluorescein and fluorescein derivatives, coumarins and chelators with the lanthanide ion series. A fluorophore is distinguished from a chromophore which absorbs, but does not characteristically re-emit light.

“Peptides” and “polypeptides” are chains of amino acids whose a carbons are linked through peptide bonds formed by a condensation reaction between the a carboxyl group of one amino acid and the amino group of another amino acid. The terminal amino acid at one end of the chain (amino terminal) therefore has a free amino group, while the terminal amino acid at the other end of the chain (carboxy terminal) has a free carboxyl group. As used herein, the term “amino terminus” (abbreviated N-terminus) refers to the free α-amino group on an amino acid at the amino terminal of a peptide or to the α-amino group (imino group when participating in a peptide bond) of an amino acid at any other location within the peptide. Similarly, the term “carboxy terminus” refers to the free carboxyl group on the carboxy terminus of a peptide or the carboxyl group of an amino acid at any other location within the peptide. Peptides also include peptide mimetics such as amino acids joined by an ether as opposed to an amide bond.

The polypeptides described herein are written with the amino terminus at the left and the carboxyl terminus at the right. The amino acids comprising the peptide components of this invention are numbered with respect to the protease cleavage site, with numbers increasing consecutively with distance in both the carboxyl and amino direction from the cleavage site. Residues on the carboxyl site are either notated with a “′” as in P₁′, or with a letter and superscript indicating the region in which they are located. The “′” indicates that residues are located on the carboxyl side of the cleavage site.

The term “residue” or “amino acid” as used herein refers to an amino acid that is incorporated into a peptide. The amino acid may be a naturally occurring amino acid and, unless otherwise limited, may encompass known analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids.

The term “domain” or “region” refers to a characteristic region of a polypeptide. The domain may be characterized by a particular structural feature such as a β turn, an alpha helix, or a β pleated sheet, by characteristic constituent amino acids (e.g. predominantly hydrophobic or hydrophilic amino acids, or repeating amino acid sequences), or by its localization in a particular region of the folded three dimensional polypeptide. As used herein, a region or domain is composed of a series of contiguous amino acids.

The terms “protease activity” or “activity of a protease” refer to the cleavage of a peptide by a protease. Protease activity comprises the “digestion” of one or more peptides into a larger number of smaller peptide fragments. Protease activity of particular proteases may result in hydrolysis at particular peptide binding sites characteristically recognized by a particular protease. The particular protease may be characterized by the production of peptide fragments bearing particular terminal amino acid residues.

The amino acids referred to herein are described by shorthand designations as shown in Table 1.

TABLE 1 Amino acid nomenclature. Abbreviation Name 3 Letter 1 Letter Alanine Ala A βAlanine (NH₂—CH₂—CH₂—COOH) βAla Arginine Arg R Asparagine Asn N Aspartic Acid Asp D Cysteine Cys C Glutamic Acid Glu E Glutamine Gln Q Glycine Gly G Histidine His H Homoserine Hse — Isoleucine Ile I Leucine Leu L Lysine Lys K Methionine Met M Methionine sulfoxide Met (O) — Methionine methylsulfonium Met (S—Me) — Norleucine Nle — Phenylalanine Phe F Proline Pro P Serine Ser S Threonine Thr T Tryptophan Trp W Tyrosine Tyr Y Valine Val V episilon-aminocaproic acid Ahx J (NH²—(CH₂)₅—COOH) 4-aminobutanoic acid gAbu — (NH₂—(CH₂)₃—COOH) tetrahydroisoquinoline-3-carboxylic — O acid 8-aminocaprylic acid — C7 4-aminobutyric acid — C3 Lys(N(epsilon)-trifluoracetyl) — k(TFA) α-aminoisobutyric acid Aib B Other abbreviations used herein include “Fm” for Fmoc (9-fluorenylmethoxycarbonyl) group, “Ac” for N(alpha)-acetyl group, “daa” or (d-aa) where “d” indicates the d isomer of the aa, and “Z” for benzyloxycarbonyl group.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A, 1B, and 1C show an HPLC analysis of the D-NorFES-A protease indicator (F¹-Asp-Ala-Ile-Pro-Nle-Ser-Ile-Pro-Cys-F²) where F¹ is a donor (D) fluoropbore (5′-carboxytetramethylrhodamine (C2211) and F² is an acceptor (A) fluorophore (rhodamine X acetamide (R492))) before and after the addition of elastase. FIG. 1A: HPLC before the addition of elastase showing the late eluting peak representing the intact indicator molecule. FIG. 1B: HPLC after the addition of elastase with detection at 550 nm where both fluorophores absorb. FIG. 1C HPLC after the addition of elastase with detection at 580 nm where F² absorbs maximally.

FIGS. 2A and 2B show the emission spectra of the D-NorFES-A fluorogenic protease indicator (FIG. 2A) before and (FIG. 2B) after the addition of elastase.

FIG. 3 shows the time-dependent increase of the fluorogenic protease indicator of FIG. 1, as a function of time after addition of 1 unit of elastase.

FIGS. 4A and 4B show the fluorescence intensity of the donor fluorophore as a function of time after addition of 1 unit of elastase. FIG. 4A: The fluorogenic protease indicator of FIG. 1. FIG. 4B: The peptide backbone of the fluorogenic protease of FIG. 1 singly labeled with each of the two fluorophores. D-NorFES-A is the F¹-Asp-Ala-Ile-Pro-Nle-Ser-Ile-Pro-Cys-F² protease indicator where F¹ is a donor fluorophore (5′-carboxytetramethylrhodamine (C2211) and F² is an acceptor fluorophore (rhodamine X acetamide (R492). D-NorFES and A-NorFES each designate a molecule having the same peptide backbone, but bearing only one of the two fluorophores.

FIG. 5 illustrates fluorescence of a DEVD, a DEVN, and an ICE substrate. To one hundred μl of assay buffer (50mM HEPES buffer pH 7.5, 10% (w/v) sucrose and 0.1% (w/v) CHAPS ) containing 1 μM of substrate DEVD (compound 2 of Example 8), DEVN (compound 3 of Example 8) and ICE (compound 5 of Example 8) 10 μl of Jurkat cell lysate was added and incubated for 16 hours at 37 C. The Jurkat cells' lysate was prepared from the cells that had been stimulated by antiFas antibody at 1 μg/ml concentration for 6 hours. The fluorescence intensity for the substrate solution alone is indicated in FIG. 5 as a horizontal lined bar marked as t=0 hr and the fluorescence intensity of the lysate and substrate solution mixture after 16 hr is indicated by vertical line bar and is marked as t=16 hr digestion. 10 μl cell lysate was pre-incubated with 50 μm ZVAD-FMK (benzyoxycarbonyl valanyl alanyl aspartyl-fluoromethylketone) at 37 degree C. for 30 min. then added to the substrate solution. The fluorescence intensity after 16 hours for this mixture is indicated by the bar marked as ZVAD-FMK (inhibitor). Lastly, pre-incubated cell lysate with iodoacetamide(alkylating agent for sulfhydryl group) and PMSF (for inhibiting serine proteases) was added to the substrate solution. The fluorescence intensity after 16 hours at 37 C is indicated by bar marked as Iodoacetamide/PMSF. The DEVN substrate is a negative control substrate where the P1, Asp, residue is replaced by Asn. The CPP32 protease requires the P1 residue to be aspartic acid residue. The four bar graphs for the DEVN substrate (FIG. 5) clearly indicate that the activated cell lysate do not contain any other protease that digest the DEVD substrate, since the intensity for 16 hour digestion is the same as the substrate alone. The bar graphs for the DEVD substrate indicate that the activate cell lysate do contain CPP32 protease and this protease activities are inhibited by ZVAD-FMK, known CPP32 protease inhibitor. The contribution of any other proteases in digesting DEVD substrate is very small as indicated by the difference between the intensities of ZVAD-FMK bar to lodoacetamide/PMSF bar.

DETAILED DESCRIPTION Fluoropenic Indicators of Protease Activity

This invention provides for novel fluorogenic molecules useful for detecting protease activity in a sample. The fluorogenic protease indicators of the present invention generally comprise a fluorophore (donor) linked to an “acceptor” molecule by a peptide having an amino acid sequence that is recognized and cleaved by a particular protease. The donor fluorophore typically is excited by incident radiation at a particular wavelength which it then re-emits at a different (longer) wavelength. When the donor fluorophore is held in close proximity to the acceptor molecule, the acceptor absorbs the light re-emitted by the fluorophore thereby quenching the fluorescence signal of the donor molecule. The quench occurs whether the two fluorophores are different or the same species. Thus, in addition to peptides double labeled with two different fluorophores as shown in Example 1, peptides double labeled with the same fluorophore may also be used as protease indicators (see, e.g., Example 6). Cleavage of a well-designed (i.e. a peptide of this invention) joining the donor fluorophore and the acceptor results in separation of the two molecules, release of the quenching effect and increase in fluorescence.

In one basic application, the fluorogenic molecules of this invention may be used to assay the activity of purified protease made up as a reagent (e.g. in a buffer solution) for experimental or industrial use. Like many other enzymes, proteases may loose activity over time, especially when they are stored as their active forms. In addition, many proteases exist naturally in an inactive precursor form (e.g. a zymogen) which itself must be activated by hydrolysis of a particular peptide bond to produce the active form of the enzyme prior to use. Because the degree of activation is variable and because proteases may loose activity over time, it is often desirable to verify that the protease is active and to often quantify the activity before using a particular protease in a particular application.

Previous approaches to verifying or quantifying protease activity involve combining an aliquot of the protease with its substrate, allowing a period of time for digestion to occur and then measuring the amount of digested protein, most typically by HPLC. This approach is time consuming, utilizes expensive reagents, requires a number of steps and entails a considerable amount of labor. In contrast, the fluorogenic reagents of the present invention allow rapid determination of protease activity in a matter of minutes in a single-step procedure. An aliquot of the protease to be tested is simply added to, or contacted with, the fluorogenic reagents of this invention and the subsequent change in fluorescence is monitored (e.g., using a fluorimeter or a fluorescence microplate reader).

In addition to determining protease activity in “reagent” solutions, the fluorogenic compositions of the present invention may be utilized to detect protease activity in biological samples. The term “biological sample”, as used herein, refers to a sample obtained from an organism or from components (e.g., cells) of an organism. The sample may be of any biological tissue or fluid. Frequently the sample will be a “clinical sample” which is a sample derived from a patient. Such samples include, but are not limited to, sputurn, blood, blood cells (e.g., white cells), tissue or fine needle biopsy samples, urine, peritoneal fluid, and pleural fluid, or cells therefrom. Biological samples may also include sections of tissues such as frozen sections taken for histological purposes.

Previously described fluorogenic protease indicators typically absorb light in the ultraviolet range (e.g., Wang et al., supra.). They are thus unsuitable for sensitive detection of protease activity in biological samples which typically contain constituents (e.g., proteins) that absorb in the ultraviolet range. In contrast, the fluorescent indicators of the present invention both absorb and emit in the visible range (400 nm to about 750 nm). These signals are therefore not readily quenched by, nor is activation of the fluorophores, that is, absorption of light, interfered with by background molecules; therefore they are easily detected in biological samples.

In addition, unlike previous fluorogenic protease indicators which often utilize a fluorophore and a quenching chromophore, the indicators of the present invention may utilize two fluorophores (i.e., fluorophore as both donor and acceptor) or the same two fluorophores effectively forming a ground-state dimer when joined by the one of the peptide backbones of this invention. Pairs of fluorophores may be selected that show a much higher degree of quenching than previously described chromophore/fluorophore combinations. In fact, previous compositions have been limited to relatively low efficiency fluorophores because of the small degree of quenching obtainable with the matching chromophore (Wang et al. supra.). In contrast, the fluorogenic protease indicators of this invention utilize high efficiency fluorophores and are able to achieve a high degree of quenching while providing a strong signal when the quench is released by cleavage of the peptide substrate. The high signal allows detection of very low levels of protease activity. Thus the fluorogenic protease indicators of this invention are particularly well suited for in situ detection of protease activity.

The fluorogenic protease indicators of the present have the general formula:

where P is a peptide comprising a protease binding site, F¹ and F² are fluorophores, C¹ and C² are conformation determining regions, and S¹ and S² are optional peptide spacers. F¹ may be the donor fluorophore while F² is the acceptor fluorophore, or conversely, F² may be the donor fluorophore while F¹ is the acceptor fluorophore, or F¹ and F² may be identical. The protease binding site provides an amino acid sequence (a peptide) that is recognized and cleaved by the protease whose activity the indicator is designed to reveal. The protease binding site is typically a peptide ranging in length from 2 amino acids to about 12 amino acids, 2 to about 10, 2 to about 8, 2 to about 6 or 2 to about 4 amino acids in length.

The conformation determining region is an amino acid sequence that introduces a bend into the molecule or otherwise restricts the degrees of freedom of the peptide backbone. The combined effect of the two conformation determining regions is to juxtapose the fluorophores attached to the amino and carboxyl termini of C¹ and C² respectively. The fluorophores are thus preferably positioned adjacent to each other at a distance less than about 100 angstroms. The fluorophores (F¹ and F²) are typically conjugated directly to the conformation determining regions, although they may be joined by linkers. The optional spacers (S¹ and S²) when present, are used to link the composition to a solid support or to anchor the composition to a component of a biological sample (e.g., to a cellular membrane).

The substantially conformation determining regions increases the protease specificity of the composition. The amino acid sequences comprising the conformation determining regions are typically less accessible to the enzyme due to steric hinderance with each other and with the attached fluorophores. Conversely, the protease binding site is relatively unobstructed by either the fluorophore or the conformational determining region and is thus readily accessible to the protease.

Protease Binding Site and Conformation Determining Regions

The protease binding site and conformation determining regions form a contiguous amino acid sequence (peptide). The protease binding site is an amino acid sequence that is recognized and cleaved by a particular protease. It is well known that various proteases cleave peptide bonds adjacent to particular amino acids. Thus, for example, trypsin cleaves peptide bonds following basic amino acids such as arginine and lysine and chymotrypsin cleaves peptide bonds following large hydrophobic amino acid residues such as tryptophan, phenylalanine, tyrosine and leucine. The serine protease elastase cleaves peptide bonds following small hydrophobic residues such as alanine.

A particular protease, however, will not cleave every bond in a protein that has the correct adjacent amino acid. Rather, the proteases are specific to particular amino acid sequences which serve as recognition domains for each particular protease. Without being bound by a particular theory, it is believed that a specific protease's preference for a particular cleavage site over many other potential sites in a folded globular protein may be largely determined by the potential cleavage site's amino acid sequences and also their conformation and conformational flexibility.

Thus, for example, one obtains limited proteolysis products, e.g., ribonuclease-S (a noncovalent complex consisting of two polypeptide chains) from a single chain folded protein ribonuclease-A using a protease called subtilisin. Similarly, one obtains a two chain noncovalent complex, Staphylococal nuclease-T, from single chain Staphylococcal nuclease by trypsin digestion. Another example of a specific protease's preference for one substrate over others is the human fibroblast-type collagenase. This protease prefers type I over type III soluble collagen even though both subsirates contain the same collagenase sensitive Gly-Ile or Gly-Leu bonds (see, e.g., Birkedal-Hansen et. al. (1993) Crit. Rev. in Oral Biology and Medicine 4:197-250).

Any amino acid sequence that comprises a recognition domain and can thus be recognized and cleaved by a protease is suitable for the “protease binding site” of the fluorogenic protease indicator compositions of this invention. Known protease substrate sequences and peptide inhibitors of proteases posses amino acid sequences that are recognized by the specific protease they are cleaved by or that they inhibit. Thus known substrate and inhibitor sequences provide the basic sequences suitable for use in the protease recognition region. A number of protease substrates and inhibitor sequences suitable for use as protease binding domains in the compositions of this invention are indicated in Table 2. One of skill will appreciate that this is not a complete list and that other protease substrates or inhibitor sequences may be used.

The amino acid residues comprising the protease binding site are, by convention, numbered relative to the peptide bond hydrolyzed by a particular protease. Thus the first amino acid residue on the amino side of the cleaved peptide bond is designated P₁ while the first amino acid residue on the carboxyl side of the cleaved peptide bond is designated P₁′. The numbering of the residues increases with distance away from the hydrolyzed peptide bond. Thus a four amino acid protease binding region would contain amino acids designated: P₂—P₁—P₁′—P₂′ and the protease would cleave the binding region between P₁ and P₁′.

In a preferred embodiment, the protease binding region of the fluorogenic protease indicators of the present invention is selected to be symmetric about the cleavage site. Thus, for example, where a binding region is Ile—Pro—Met—Ser—ILe (e.g. α-1 anti-trypsin) and the cleavage occurs between Met and Ser then a four amino acid residue binding region based on this sequence would be: —P₂—P₁—P₁′—P₂′— —Pro—Met—Ser—ILe— Other examples of binding domains selected out of longer sequences are provided in Table 2. The remaining amino or carboxyl residues that are not within the protease binding domain may remain as part of the conformation determining regions subject to certain limitations as will be explained below. Thus, in the instant example, the amino terminal Ile may be incorporated into the C¹ conformation determining region.

Various amino acid substitutions may be made to the amino acids comprising the protease binding domain to increase binding specificity, to eliminate reactive side chains, or to reduce the conformational entropy (decrease degrees of freedom) of the molecule. Thus, for example, it is often desirable to substitute methionine (Met) residues, which bear a oxidizable sulfur, with norleucine. Thus, in the example given, a preferred protease binding region will have the sequence:

 —P₂—P₁—P₁′—P₂′— —Pro—Nle—Ser—ILe—

Conformation Determining Regions

Conformation determining regions (C¹ and C²) are peptide regions on either end of the protease cleavage region that both stiffen and introduce bends into the peptide backbone of the fluorogenic protease indicator molecules of this invention. The combination of the two conformation determining regions and the relatively straight protease cleavage region produces a roughly U-shaped molecule with the cleavage site at the base (middle) of the “U”. The term U-shaped is, of course, approximate, the point being that, as described below, the fluorophores are held relatively rigidly in close juxtaposition (e.g., less than about 100 angstroms).

In one preferred embodiment, amino acids such as proline (Pro) and α-aminoisobutyric acid (Aib) are selected both to introduce bends into the peptide molecule and to increase the rigidity of the peptide backbone. The C¹ and C² domains are selected such that the “arms” of the U are rigid and the attached fluorophores are localized adjacent to each other at a separation of less than about 100 angstroms. In order to maintain the requisite stiffness of the peptide backbone and placement of the fluorophores, the conformation determining regions are preferably 4 amino acids in length or less, or alternatively are greater than about 18 amino acids in length and form a stable alpha helix conformation or a β-pleated sheet.

A) Tetrapeptide Binding Site Compositions.

In a preferred embodiment, the peptide backbone of the fluorogenic protease indicators of the present invention will comprise a tripeptide C¹ region, a tetrapeptide P region and a single amino acid or dipeptide C² region. These compounds may be represented by the formula:

where Y is either

In these formulas the peptide binding region is designated —P₂—P₁—P₁′—P₂′—, while the amino acid residues of conformation determining regions C¹ and C² are designated —C¹ ₅—C¹ ₄—C¹ ₃— and —C² ₃—C² ₄— respectively. The C² region may either be an amino acid or a dipeptide. Whether the C² region is a dipeptide or an amino acid, the F² fluorophore and the S² spacer, when present, are always coupled to the carboxyl terminal residue of C². When a spacer is present on the C² region, it is attached the carboxyl terminal residue of C² by a peptide bond to the a carboxyl group.

As indicated above, the conformation determining regions typically contain amino acid residues such as a proline (Pro) that introduce a bend into the molecule and increase its stiffness. One of skill in the art will appreciate, however that where the terminal residues of the protease binding region (P) are themselves bend-creating residues such as proline, it is not necessary to locate a bend-creating residue at the position closest to P in the C region attached to that terminus. The conformation determining regions are thus designed by first determining the protease binding region, as described above, determining the “left-over” residues that would lie in the conformation determining regions, and if necessary, modifying those residues according to the following guidelines:

-   -   1. If the P₂′ site is not a Pro then C² is a dipeptide         (Formula III) Pro-Cys, Aib-Cys, Pro-Lys, or Aib-Lys, while         conversely, if the P₂′ site is a Pro then C² is a single amino         acid residue (Formula IV) Cys or Lys.     -   2. If the P₂ site is not a Pro then C¹ is a tripeptide         consisting of Asp-C¹ ₄-Pro, Asp-C¹ ₄-Aib, Asp-Aib-Pro,         Asp-Pro-C¹ ₃, Asp-Aib-C¹ ₃, Asp-Pro-Aib, or Asp-Aib-Aib, while         if the P₂ site is a Pro residue then group C¹ is a tripeptide         consisting of Asp-C ¹ ₄—C¹ ₃ or Asp-C¹ ₄-Aib.     -   3. If the P₃ (C¹ ₃) residue is a Pro then C¹ is a tripeptide         consisting of Asp-C¹ ₄-Pro or Asp-Aib-Pro.     -   4. If the P₄ (C¹ ₄) residue is a Pro then C¹ is a tripeptide         consisting of Asp-Pro-C¹ ₃ or Asp-Pro-Aib.     -   5. If P₂ and C¹ ₃ are both not prolines then C¹ is a tripeptide         consisting of Asp-Pro-C¹ ₃, Asp-Aib-C¹ ₃, Asp-C¹ ₄-Pro, Asp-C¹         ₄-Aib, Asp-Pro-Aib, or Asp-Aib-Pro.         As indicated above, any methionine (Met) may be replaced with a         norleucine (Nle). A number of suitable peptide backbones         consisting of C¹, P and C² are provided in Table 2.

TABLE 2 Illustration of the design of the conformation determining regions and protease binding site based on known protease substrate and inhibitor sequences. Italics indicate residues that are added to create a bend and to increase rigidity of the conformation determining regions. Normal font indicates residues of the substrate or inhibitor that forms the protease binding site. The thick line indicates the location at which the protease binding site is cleaved. CDR (C¹) Protease Binding Site (P) CDR (C²) Substrate/Inhibitor C¹ ₅ C¹ ₄ C¹ ₃ P₂ P₁ P₁′ P₂′ C² ₃ C² ₄ α-1 anti-trypsin Asp Ala Ile Pro Met Ser Ile Pro Cys Nle Aib Lys plasminogen Asp Met Thr Gly Arg Thr Gly Pro Cys activator inhibitor 2 Aib Aib Aib Lys Pro Pro neutrophil leukocyte Asp Ala Thr Phe Cys Met Leu Pro Cys elastase inhibitor Aib Aib Nle Aib Lys Pro anti-plasmin Asp Aib Ser Arg Met Ser Leu Pro Cys inhibitor Aib Nle Aib Lys Pro anti α-1 thrombin Asp IleA Ala Gly Arg Ser Leu Pro Cys ib Aib Aib Lys Pro α-1 Asp Aib Thr Leu Leu Ser Leu Pro Cys antichymotrypsin Aib Aib Lys Pro interstitial type III Asp Gly Pro Leu Gly Ile Ala Pro Cys (human liver) Aib Aib Aib Lys collagen type I collagen for Asp Gly Pro Gln Gly Ile Leu Pro Cys collagenase Bovine Aib Aib Aib Lys α 1 Pro type I collagen chick Asp Gly Pro Gln Gly Leu Leu Pro Cys α2 Aib Aib Aib Lys Pro human α1 type II Asp Gly Pro Gln Gly Ile Ala Pro Cys collagen Aib Aib Aib Lys Pro type III collagen - Asp Gly Pro Gln Ala Ile Ala Pro Cys AIA Aib Aib Aib Lys Pro type III collagen Asp Gly Pro Gln Gly Ile Ala Pro Cys (human skin) Aib Aib Aib Lys Pro human α 2 Asp Gly Pro Glu Gly Leu Arg Pro Cys macroglobulin Aib Aib Aib Lys Pro stromelysin cleavage Asp Asp Val Gly His Phe Arg Pro Cys sites of stromelysin- Aib Aib Aib Lys 1d Pro Pro stromelysin cleavage Asp Asp Thr Leu Glu Val Met Pro Cys sites of stromelysin- Aib Aib Nle Aib Lys 1 Pro Pro stromelysin cleavage Asp Arg Ala Ile His Ile Gln Pro Cys site of proteoglycan Aib Aib Aib Lys link protein Pro Pro gelatinase type IV Asp Asp Val Ala Asn Tyr Asn Pro Cys collagenase site of Aib Aib Aib Lys 72K gelatinases Pro Pro gelatinase type IV Asp Gly Pro Ala Gly Glu Arg Pro Cys cleavage of gelatin Aib Aib Aib Lys Pro gelatinase type IV Asp Gly Pro Ala Gly Phe Ala Pro Cys cleavage of gelatin Aib Aib Aib Lys Pro type III collagen Asp Gly Pro Gln Gly Leu Ala Pro Cys (human skin) Aib Aib Aib Lys Pro Human FIB-CL Asp Asp Val Ala Gln Phe Val Pro Cys propeptide Aib Aib Aib Lys Pro Pro Cathepsin D Asp Asp Gly His Phe Leu Arg Pro Cys (Thyroglobulin Aib Pro Aib Lys Fragment Tg1) Pro Aib Cathepsin D Asp Thr Thr Glu Leu Phe Ser Pro Cys (Thyroglobulin Aib Pro Aib Lys Fragment Tg2) Pro Aib Cathepsin D Asp Lys Phe leu Ala Phe Leu Pro Cys (Thyroglobulin Aib Pro Aib Lys Fragment Tg3) Pro Aib Cathepsin D Asp Phe Ser His Phe Val Arg Pro Cys (Thyroglobulin Aib Pro Aib Lys Fragment Tg4) Pro Aib Prostate Specific Asp Gln Gln Leu Leu His Asn Pro Cys Antigen (PSA) Aib Pro Aib Lys (Seminolgelin, Sg) Pro Aib Sg1 Prostate Specific Asp Ser Ile Gln Tyr Thr Tyr Pro Cys Antigen (PSA) Aib Pro Aib Lys (Seminolgelin, Sg) Pro Aib Sg2 Prostate Specific Asp Ser Ser Gln Tyr Ser Asn Pro Cys Antigen (PSA) Aib Pro Aib Lys (Seminolgelin, Sg) Pro Aib Sg3 Prostate Specific Asp Ser Ser Ile Tyr Ser Gln Pro Cys Antigen (PSA) Aib Pro Aib Lys (Seminolgelin, Sg) Pro Aib Sg4 Gelatin α1 (type 1) Asp Gly Pro Ala Gly Val Gln Pro Cys Aib Aib Aib Lys Pro ¹In a preferred embodiment, the sequence may be followed by an S₂ spacer of Gly-Tyr. Thus, for example, where C² ₄ is Lys, C² ₄-S₂ is Lys-Gly-Tyr. Indicators Having Other Binding Sites.

In another preferred embodiment, the binding site (P) ranges from 2 to about 12 amino acids in length. It was a discovery of this invention, that somewhat larger conformation determining regions can sufficiently restrict the degrees of freedom of the indicator molecule, that the fluorophores are suitably quenched regardless of the amino acid sequence of the binding (recognition) domain (P). In one preferred embodiment, these compositions are include the compounds represented by the Formula V:

In this formula, P is a peptide comprising a protease binding site and consists of 2 to about 12 amino acids, F¹ and F² are fluorophores where F¹ is attached to the amino terminal amino acid and F² is attached to the carboxyl terminal amino acid of the composition (excluding spacers). S¹ and S², when present, are peptide spacers ranging in length from 1 to about 50 amino acids and S¹, when present, is attached to the amino terminal amino acid, while S², when present, is attached to the carboxyl terminal amino acid. The subscripts i, j, k, l, m, n, o, p, q, and r are independently 0 or 1.

In a particularly preferred embodiment, aa¹ and aa¹⁰ are independently selected from the group consisting of lysine, ornithine and cysteine; aa², aa³, aa⁸ and aa⁹ are independently selected from the group consisting of an amino acid or a dipeptide consisting of Asp, Glu, Lys, Ornithine, Arg, Citulline, homocitrulline, Ser, bomoserine, Thr, and Tyr; aa⁵, aa⁴, aa⁶, and aa⁷ are independently selected from the group consisting of proline, 3,4-dehydroproline, hydroxyproline, alpha aminoisobutyric acid and N-methyl alanine; X is selected from the group consisting of Gly, βAla, γAbu, Gly-Gly, Ahx, βAla-Gly, βAla-βAla, γAbu-Gly, βAla-γAbu, Gly-Gly-Gly, γAbu-γAbu, Ahx-Gly, βAla-Gly-Gly, Ahx-βAla, βAla-βAla-Gly, Gly-Gly-Gly-Gly, Ahx-γAbu, βAla-βAla-βAla, γAbu-βAla-Gly, γAbu-γAbu-Gly, Ahx-Ahx, γAbu-γAbu-βAla, and Ahx-Ahx-Gly; Y is selected from the group consisting of Gly, βAla, γAbu, Gly-Gly, Ahx, Gly-βAla, βAla-βAla, Gly-γAbu, γAbu-βAla, Gly-Gly-Gly, γAbu-γAbu, Gly-Ahx, Gly-Gly-βAla, βAla-Ahx, Gly-βAla-βAla, Gly-Gly-Gly-Gly, γAbu-Ahx, βAla-βAla-βAla, Gly-βAla-γAbu, Gly-γAbu-γAbu, Ahx-Ahx, βAla-γAbu-γAbu, and Gly-Ahx-Ahx.

When i is 1, S¹ is joined to aa¹ by a peptide bond through a terminal alpha amino group of aa¹; and when r is 1, S² is joined to aa¹⁰ by a peptide bond through a terminal alpha carboxyl group of aa¹⁰. It will be appreciated that amino acids 1-4 or 7-10 may be absent. When one or more of these amino acids are absent, the fluorophores are attached to the remaining terminal amino acids.

The amino acid backbones of such particularly preferred compositions are listed in Tables 3 and 4.

Table 4. Illustration of the design of the conformation determining regions and protease binding sites in molecules having P domains larger that 4 amino acids. The P1 residue is underlined. Z is benzyloxycarbonyl group, K(TFA)means Lys(N(epsilon)trifluoroacetyl), Fm is Fmoc (preferably attached to the alpha amino group of the amino terminal residue e.,g., Lysine (K). O indicates tetrahydroisoquinoline-3-carboxylic acid. Aib, designated as B, can be replaced by Pro. J is a C

TABLE 3 Illustration of the design of the conformation determining regions and protease binding sites in molecules having P domains larger that 4 amino acids. The P1 residue is underlined. Z is benzyloxycarbonyl group, K[TFA] means Lys (N(epsilon)trifluoroacetyl), Fm is Fmoc (preferably attached to the alpha amino group of the amino terminal residue e.,g., Lysine (K). O indicates tetrahydroisoquinoline-3-carboxylic acid. Aib, designated as B, can be replaced by Pro. SEQ aa²- aa⁸- ID Name aa¹ aa³ aa⁴ aa⁵ X P Y aa⁶ aa⁷ aa⁹ aa¹⁰ S² NO PAI-2 K D B TGRTG P K GY 1 PAI-2(b) K D P P TGRTG P P K GY 2 DEVD K D B DEVDGID P K GY 3 DevN K D B DEVNGID P K GY 4 PARP K D B EVDGID P K GY 5 ICE K DY B ADGID P K GY 6 Fm-DEVD Fm-K D B DEVDGID P K GY 7 Fm-DEVN Fm-K D B DEVNGID P K GY 8 Fm-PARP Fm-K D B EVDGID P K GY 9 Fm-KNFES Fm-K D — AIPMSI P K GY 10 Fm-K D AIPNluSI P K GY 11 Fm-G2D2D Fm-K D B GDEVDGID G P K GY 12 Fm-CGD2D Fm-K D B J GDEVDGID GJ P K GY 13 Z-CGD2D Z-K D B J GDEVDGID GJ P K GY 14 Fm-ICE Fm-K DY B ADGID P K GY 15

TABLE 4 Illustration of the design of the conformation determining regions and protease binding sites in molecules having P domains larger that 4 amino acids. The P1 residue is underlined. Z is benzyloxycarbonyl group. K[TFA] means Lys (N(epsilon)trifluoroacetyl), Fm is Fmoc (preferable attached to the alpha amino group of the amino terminal residue e.,g., Lysine (K). O indicates tetrahydroisoquinoline-3-carboxylic acid. Aib, designated as B, can be replaced by Pro. J is epsilon-aminocaproic acid. Substrate aa²- aa⁸- SEQ class aa¹ aa³ aa⁴ aa⁵ X P Y aa⁶ aa⁷ aa⁹ aa¹⁰ S² ID NO CPP32 substrates (preferably with DER and TMR fluorophores). Note where Fmoc (Fm) is indicated, it is optional, and where not indicated it can be added. Fa-K D P JG DEVDGIN GJ P K GY 261 Fm-K D P JG DEVDGIN GJ P K 262 amide Fm-K D P JG (d-O)DEVDGIN GJ P K GY 263 Fm-K D P JG DEVDGIN G P K GY 264 Fm-K D P G DEVDGIN GJ P K GY 265 Fm-K D P JG DEVDGID GJ P K 266 amide Fm-K D P JG EEVEGIN GJ P K GY 267 Fm-K D P JG D(dF)VDGIN GJ P K GY 268 Fm-K D P JG (d-D)EV(d-D)GIN GJ P K GY 269 Fm-K D P JG DEVDGIN GJ P K GY 270 Fm-K DB JG DEVNGIN GJ P K GY 271 Fm-K DB JG DEVDGID GJ P K GY 272 Fm-K DB JG DEVDGIN GJ P K GY 273 Fm-K DB JG DEVNGID GJ P K GY 274 K D B JJ GDEVDGID JJ P K GY 275 K D B J GNEVDGID GJ P K GY 276 K D B J GDEVDGIN GJ P K GY 277 K D B J GNEVDGIN GJ P K GY 278 K D B J GDEVNGIN GJ P K GY 279 K D B J GNEVNGIN GJ P K GY 280 K D B JG ODEVDGID GJ P K GK 281 K D B JG dODEVDGID GJ P K GY 282 K D B JG WDEVDGID GJ P K GY 283 K D B JG dWDEVDGID GJ P K GY 284 K D B JG dOdODEVDGID GJ P K GY 285 K D B JG dWdWDEVDGID GJ P K GY 286 K D B YVADGID P K GY 287 K D B YVADGIN P K GY 288 K D B YVANGIN P K GY 289 K D B G YVADGID G P K GY 290 K D B G YVADGIN G P K GY 291 K D B G YVANGIN G P K GY 292 K D B JG YVADGID GJ P K GY 293 K D B JG YVANGID GJ P K GY 294 K D B JG YVANGIN GJ P K GY 295 K D B JG YVADGIN GJ P K GY 296 K D B JG dYVADGIN GJ P K GY 297 LAMIN-A Fm-K D P JG LVEIDNG J P K GY 298 Fm-K DP JG LVEIENG J P K GY 299 K D B LVEIDNG P K GY 300 K D B G LVEIDNG G P K GY 301 K D B JG LVEIDNG GJ P K GY 302 K D B JG LVEINNG GJ P K GY 303 ProCPP32Asp175 Fm-K D P J GIETESGV GJ P K GY 304 Fm-K D P J GIETDSG J P K GY 305 Fm-K D P J GIETESG J P K GY 306 K D B GIETDSGVDD P K GY 307 K D B GIETNSGVDD P K GY 308 K D B G GIETDSGVDD G P K GY 309 K D B G GIETNSGV G P K GY 310 K D B J GIETDSGV J P K GY 311 K D B J GIETNSGV J P K GY 312 K D B JG GIETDSGV GJ P K GY 313 K D B JG GIETNSGV GJ P K GY 314 ProCPP32Asp28 K D B GSESMDSGISLD P K GY 315 K D B G GSESMDSG G P K GY 316 K D B JG GSESMDSG GJ P K GY 317 NS3 NS5A/5B K D B JG DVVCCSMS GJ P K GY 318 K D B JG DVVCDSMS GJ P K GY 319 K D B JG DVVCCSdMS GJ P K GY 320 K D B JG DVVCDSdMS GJ P K GY 321 K D B JG DVVCCPdMS GJ P K GY 322 K D B JG EDVVCCS GJ P K GY 323 K D B JG EDVVCDS GJ P K GY 324 K D B JG EDdVVCCP GJ P K GY 325 K D B JG EDdVVCDP GJ P K GY 326 K D B JG DdVVCCSdMS GJ P K GY 327 K D B JG DVdVCDSdMS GJ P K GY 328 K D B JG DdVVCCPdMS GJ P K GY 329 K D B JG DVVCCSM GJ P K GY 330 K D B JG DVVCDSM GJ P K GY 331 K D B JG VCCSM GJ P K GY 332 K D B JG VCDSM GJ P K GY 333 NS3 NS4A/4B K D B JG DEMEECSQHL P K GY 334 K D B JG DEMEECPQHL P K GY 335 K D B JG DEMEEDSQHL P K GY 336 K D B JG EMEECSQHL P K GY 337 K D B JG EMEECPQHL P K GY 338 K D B JG EMEEDSQHL P K GY 339 K D B JG EMEECSQHL G P K GY 340 K D B JG EMEECPQHL G P K GY 341 K D B JG EMEEDSQHL G P K GY 342 K D B JG EMEECSQHL GJ P K GY 343 K D B JG EMEECPQHL GJ P K GY 344 K D B JG EMEEDSQHL GJ P K GY 345 Ext. PAI-2 K D B JG VMTGRTG J P K GY 346 K D B JG VdMTGRTG J P K GY 347 K D B JG VMTGRTG J P K GY 348 K D B JG VMTGRTG J P K GY 349 THROMB K D B JG VMTGRG J P K GY 350 K D B JG VMTGRG GJ P K GY 351 K D B JG VdmTGRG GJ P K GY 352 Urokinase Fm-K D P J TGRT 353 Fm-D P TGRT G P K GY 354 Fm-K D P VMTGRT GJ P K GY 355 Fm-K D P TGRT GJ P K GY 356 Fm-K D P JG TGRT GJ P K GY 357 Fm-K D P JG TGRT G P K GY 358 Fm-K D P G TGRT G P K GY 359 K D P J TGRTG J P K GY 360 K D P C3 TGRTG P K GY 361 K D P C7 TGRTG P K GY 362 K D B JG VMTGRVG J P K GY 363 K D B JG VdMTGRVG J P K GY 364 F12A K D B JG VMTGRAG J P K GY 365 K D B JG VdMTGRAG J P K GY 366 Swedish KM/NL AMLOID Fm-K D P JG SEVKLDAEFGC5PKGY GJ P K GY 367 Fm-K D P JG S(d-E)VK(d-L)DAE(d-F) GJ P K GY 368 Fm-K D P JG S(d-E)VK(d-L)DAE(d-F) GJ P K GY 369 K D B JG SEVNLDAEF GJ P K DDY 370 K D B JG SEVKLDAEF GJ P K DDY 371 NATIVE AMYLOID K D B JG SEVKMDAEF GJ P K DDY 372 CATHESPSIN G K D B JG SEVKMDDEF GJ P K DDY 373 K D B JG SEVNLDDEF GJ P K DDY 374 APP[709-710] K D B JG GVVIATVIVIT GJ P K DDY 375 APP[708-719] K D B JG YGVVIATVIVIT GJ P K DDY 376 APP[711-716] K D B JG VIATVI GJ P K DDY 377 APP[708-713] K D B JB YGVVIA GJ P K DDY 378 PSA Sg1 K D B JJ QQLLHN JJ P K 379 K D B JG QQLLHN GJ P K 380 K D B G QQLLHN G P K 381 K D B QQLLHN P K 382 PSA Sg2 K D B JJ SIQYTY JJ P K 383 K D B JG SIQYTY GJ P K 384 K D B G SIQYTY G P K 385 K D B SIQYTY P K 386 PSA Sg3 K D B JJ SSQYSN JJ P K 387 K D B JG SSQYSN GJ P K 388 K D B G SSQYSN G P K 389 K D B SSQYSN P K 390 PSA Sg4 K D B JJ SSIYSQ JJ P K 391 K D B JG SSIYSQ GJ P K 392 K D B G SSIYSQ G P K 393 K D B SSIYSQ P K 394 Cathepsin D substrates (preferably with diethylrhodamine fluorophore, note fmoc (Fm) is optional) Fm-K D P JG SEVNLDAEF GJ P K GY 395 Caspase-9 Fm-K D P JG LEHDGIN GJ P K GY 396 Caspase-8 Fm-K D P JG LETDGIN GJ P K GY 397 Caspase-1 Fm-K D P JG WEHDGIN GJ P K GY 398 Fm-K D P JG YVHDG J P K GY 399 Fm-K D P JG YVHDGIN GJ P K GY 400 Fm-K D P JG YVHDA P K GY 401 Granzyme B Fm-K DP JG IEPDS GJ P K GY 402 Collagenase Fm-K DP JG PLGIAGI GJ P K GY 403 HIV-1 protease Fm-K DP JG SQNYPIVQ GJ P K GY 404 Hepatitis C protease Fa-K DP JG EDVVCCS GJ P K GY 405 * In certain embodiments, the Fm or Fa groups identified in the above sequences are optional or can be substituted with other hydrophobic groups. Conversely any of the sequences listed without a hydrophobic group can have one added. In addition, in certain embodiments, the carboxyl terminal amino acid can have the carboxylic acid group replaced with an amide

“Donor” and “Acceptor” Fluorophores

A fluorophore excited by incident radiation absorbs light and then subsequently re-emits that light at a different (longer) wavelength. However, in the presence of a second class of molecules, known as “acceptors” the light emitted by a so-called donor fluorophore is absorbed by the acceptor thereby quenching the fluorescence signal of the donor. Thus, use of two fluorophores, as opposed to a fluorophore/chomophore pair, allows a clearer assessment of the overlap between the emission spectrum of the donor and the excitation spectrum of the acceptor. This facilitates the design of a peptide backbone that allows allowing optimization of the quenching. This results in a high efficiency donor/acceptor pair facilitating the detection of low concentrations of protease activity. Thus, although a fluorophore/chromophore combination may be suitable, in a preferred embodiment, the fluorogenic protease inhibitors of this invention will comprise two fluorophores.

The “donor” and “acceptor” molecules are typically selected as a matched pair such that the absorption spectra of the acceptor molecule overlaps the emission spectrum of the donor molecule as much as possible. In addition, the donor and acceptor fluorophores are preferably selected such that both the absorption and the emission spectrum of the donor molecule is in the visible range (400 nm to about 700 nm). The fluorophores thereby provide a signal that is detectable in a biological sample thus facilitating the detection of protease activity in biological fluids, tissue homogenates, in situ in tissue sections, and the like. The emission spectra, absorption spectra and chemical composition of many fluorophores are well known to those of skill in the art (see, for example, Handbook of Fluorescent Probes and Research Chemicals, R. P. Haugland, ed. which is incorporated herein by reference).

Preferred fluorophore pairs include the rhodamine derivatives. Thus, for example 5-carboxytetramethylrhodamine or the succinimidyl ester of 5- and/or 6-carboxytetramethylrhodamine (9-(2,5-dicarboxyphenyl)-3,6-bis-(dimethylamino)xanthylium chloride (5-TMR) and 9-(2,6-dicarboxyphenyl)-3,6-bis-(dimethylamino)xanthylium chloride (6-TMR), C2211 and C1171 respectively, available from Molecular Probes, Eugene, Oreg., USA) (Formula VI) is a particularly preferred donor molecule

and rhodamine X acetamide (R 492 from Molecular Probes) (Formula VII)

or the succinimidyl ester of 5- and/or 6-carboxy-X-rhodamine (9-(2,5-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthene (5-DER) and 9-(2,6-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthene (6-DER), mixed isomer available as C11309 (designated herein as DER) from Molecular Probes) is a particularly preferred receptor molecule. These fluorophores are particularly preferred since the excitation and emission of both members of this donor/acceptor pair are in the visible wavelengths, the molecules have high extinction coefficients and the molecules have high fluorescence yields in solution. The extinction coefficient is a measure of the light absorbance at a particular wavelength by the chromophore and is therefore related to its ability to quench a signal, while the fluorescence yield is the ratio of light absorbed to light re-emitted and is a measure of the efficiency of the fluorophore and thus effects the sensitivity of the protease indicator.

Other preferred fluorophores include, but are not limited to rhodamine X, 9-(2,5 (or 2,6)-dicarboxyphenyl)-3,6-bis(dimethylamino)xanthyliumhalide or other anion (TMR), 9-(2,5)-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium halide or other anion (Rh6G), 9-(2,6)-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium halide or other anion, 9-(2,5 (or 2,6)dicarboxyphenyl)-3,6-bisamino-xanthylium halide or other anion (Rh110), 9-(2,5 (or 2,6)-dicarboxyphenyl)-3-amino-6-hydroxy-xanthylium halide or other anion (Blue Rh), 9-(2-carboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium, 9-(2-carboxyphenyl)-3,6-bis(dimethylamino)xanthylium, and 9-(2-carboxyphenyl)-xanthylium.

Of course, while not most preferred, fluorophores that absorb and emit in the ultraviolet may also be used in the protease indicators of the present invention. One particularly preferred ultraviolet absorbing pair of fluorophores is 7-hydroxy-4-methylcoumarin-3-acetic acid as the donor molecule (Formula VIII)

and 7-diethylamino-3-((4′-iodoacetyl)amino)phenyl)-4-methylcournarin (Formula IX) as the acceptor molecule.

These and other fluorophores are commercially available from a large number of manufacturers such as Molecular Probes (Eugene, Oreg., USA).

It was a surprising discovery that fluorophores having matched absorption and emission spectra are not required in the practice of the present invention. In fact, a single species of fluorophore, when joined to the polypeptide backbones of this invention in the positions occupied by F¹ and F², is capable of quenching itself. Moreover, this quenching is fully released when the peptide backbone is cleaved.

Without being bound to a particular theory, it is believed that quenching is accomplished by the formation of ground state dimers wherein the electron orbitals of the two fluorophores interact resulting in reciprocal quenching. It is the limited conformational entropy of the peptide backbones of this invention that forces fluorophores into close enough proximity to effectively form a ground state dimer.

Particularly preferred molecules form H-type dimers. The formation of H-type dimers by fluorescent molecules is described by Packard et al. (1996) Proc. Natl. Acad. Sci. USA, 93: 11640-11645. The H-type dimer is characterized by exciton bands in the absorption spectra and fluorescence quenching (see, e.g., Valdes-Aguilera et al. (1989) Acc. Chem. Res., 22: 171-177 and Packard et al. (1996) Proc. Natl. Acad. Sci. USA, 93: 11640-11645).

Thus, in a preferred embodiment, the protease indicators of this invention include only a single species of fluorophore, more preferably a fluorophore capable of forming H-type dimers.

NorFes is an undecapeptide that contains a recognition sequence and cleavage site for the serine protease elastase. When NorFes was doubly labeled with a variety of fluorophores on opposite sites of the amino acid sequence, the fluorescence was quenched due to formation of intramolecular ground-state dimers. The spectral characteristics of these dimers were predictable by exciton theory.

The decrease in dimer/monomer ratios as the temperature was raised indicated an intermolecular attraction between the dye molecules. The free energy of activation of disruption of homodimers composed of tetramethylrhodamine was at least 1.7 kcal/mole and for those of diethylrhodamine was 2.4 kcal/mole. Because of the intermolecular attraction of fluorophores that form exciton dimers the connecting amino acid sequences can deviate from the optimal sequences described herein. Thus, when exciton-forming fluorphores are used, amino acid substitutions can be made in the Abackbones@ described herein and activity can still be maintained.

Particularly preferred exciton-forming fluorophores include carboxytetramethylrhodamine, carboxyrhodamine-X, diethylaminocoumarin, and carbocyanine dyes. In this embodiment, there is no need to match emission or absorption spectra since only a single fluorophore is used. Thus a wide variety of fluorophores can be used effectively. In addition, the use of a single fluorophore greatly simplifies synthesis chemistry.

The use of homo-doubly labeled indicators (indicators labeled with a single species of fluorophore) of this invention also permits detection of enzymatic activity by absorbance measurements in addition to fluorescence measurements. Since blue-shifted exciton bands (or blue-shifted absorption maxima) in absorption spectra denote H-dimer formation and fluorescence quenching is concomitant with the latter, measurement of absorption spectra may be sufficient as a diagnostic tool in the proper setting. When a doubly labeled protease indicator is cleaved by a specific protease, the H-type dimer is disrupted. The blue shifted absorption maximum associated with the H-type dimer is then lost. Hence, if one measures the intensity of absorption at this blue shifted exciton band then as the H-type dimer is disrupted the absorption intensity is expected to decreased whereas the absorption intensity at the monomer maximum peak wavelength is expected to increase.

Preparation of Fluoropenic Protease Indicators

The fluorogenic protease indicators of the present invention are preferably prepared by first synthesizing the peptide backbone, i.e. the protease cleavage site (P), the two conformation determining regions (C¹ and C²), and the spacers (S¹ and S²) if present. The fluorophores are then chemically conjugated to the peptide. The fluorophores are preferably conjugated directly to the peptide however, they may also be coupled to the peptide through a linker. Finally, where the fluorogenic protease indicator is to be bound to a solid support, it is then chemically conjugated to the solid support via the spacer (S¹ or S²) either directly or through a linker.

Preparation of the Peptide Backbone

Solid phase peptide synthesis in which the C-terminal amino acid of the sequence is attached to an insoluble support followed by sequential addition of the remaining amino acids in the sequence is the preferred method for preparing the peptide backbone of the compounds of the present invention. Techniques for solid-phase synthesis are described by Barany and Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part a., Merrifield, et al. J. Am. Chem. Soc. 85, 2149-2156 (1963), and Gross and Meienhofer, eds. Academic press, N.Y., 1980 and Stewart et al., Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, Ill. (1984) which are incorporated herein by reference. Solid phase synthesis is most easily accomplished with commercially available peptide synthesizers utilizing FMOC or TBOC chemistry. The chemical synthesis of the peptide component of a fluorogenic protease indicator is described in detail in Examples 1 and 2.

In a particularly preferred embodiment, peptide synthesis is performed using Fmoc synthesis chemistry. The side chains of Asp, Ser, Thr and Tyr are preferably protected using t-Butyl and the side chain of Cys residue using S-trityl and S-t-butylthio, and Lys residues are preferably protected using t-Boc, Fmoc and 4-methyltrityl for lysine residues. Appropriately protected amino acid reagents are commercially available. The use of multiple protecting groups allows selective deblocking and coupling of a fluorophore to any particular desired side chain. Thus, for example, t-Boc deprotection is accomplished using TFA in dichloromethane, Fmoc deprotection is accomplished using 20% (v/v) piperidine in DMF or N-methylpyrolidone, and 4-methyltrityl deprotection is accomplished using 1 to 5% (v/v) TFA in water or 1% TFA and 5% triisopropylsilane in DCM, S-t-butylthio deprotection is accomplished in aqueous mercaptoethanol (10%), t-butyl and t-boc and S-trityl deprotection is accomplished using TFA:phenol:water:thioanisol:ethanedithiol (85:5:5:2.5:2.5), and t-butyl and t-Boc deprotection is accomplished using TFA:phenol:water (95:5:5). Detailed synthesis, deprotection and fluorophore coupling protocols are provided in Examples 1 and 2.

Alternatively, the peptide components of the fluorogenic protease indicators of the present invention may be synthesized utilizing recombinant DNA technology. Briefly, a DNA molecule encoding the desired amino acid sequence is synthesized chemically by a variety of methods known to those of skill in the art including the solid phase phosphoramidite method described by Beaucage and Carruthers, Tetra. Letts. 22: 1859-1862 (1981), the triester method according to Matteucci, et al., J. Am. Chem. Soc., 103:3185 (1981), both incorporated herein by reference, or by other methods known to those of skill in the art. It is preferred that the DNA be synthesized using standard β-cyanoethyl phosphoramidites on a commercially available DNA synthesizer using standard protocols.

The oligonucleotides may be purified, if necessary, by techniques well known to those of skill in the art. Typical purification methods include, but are not limited to gel electrophoresis, anion exchange chromatography (e.g. Mono-Q column, Pharmacia-LKB, Piscataway, N.J., USA), or reverse phase high performance liquid chromatography (HPLC). Method of protein and peptide purification are well known to those of skill in the art. For a review of standard techniques see, Methods in Enzymology Volume 182: Guide to Protein Purification, M. Deutscher, ed. (1990), pages 619-626, which are incorporated herein by reference.

The oligonucleotides may be converted into double stranded DNA either by annealing with a complementary oligonucleotide or by polymerization with a DNA polymerase. The DNA may then be inserted into a vector under the control of a promoter and used to transform a host cell so that the cell expresses the encoded peptide sequence. Methods of cloning and expression of peptides are well known to those of skill in the art. See, for example, Sambrook, et al., Molecular Cloning: a Laboratory Manual (2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory (1989)), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning Techniques (Berger and Kimmel (eds.), San Diego: Academic Press, Inc. (1987)), or Current Protocols in Molecular Biology, (Ausubel, et al. (eds.), Greene Publishing and Wilcy-Interscience, New York (1987), which are incorporated herein by reference.

Linkage of the FluoroDhores to the Peptide Backbone

The fluorophores are linked to the peptide backbone by any of a number of means well known to those of skill in the art. In a preferred embodiment, the fluorophore is linked directly from a reactive site on the fluorophore to a reactive group on the peptide such as a terminal amino or carboxyl group, or to a reactive group on an amino acid side chain such as a sulfur, an amino, a hydroxyl, or a carboxyl moiety. Many fluorophores normally contain suitable reactive sites. Alternatively, the fluorophores may be derivatized to provide reactive sites for linkage to another molecule. Fluorophores derivatized with functional groups for coupling to a second molecule are commercially available from a variety of manufacturers. The derivatization may be by a simple substitution of a group on the fluorophore itself, or may be by conjugation to a linker. Various linkers are well known to those of skill in the art and are discussed below.

As indicated above, in a preferred embodiment, the fluorophores are directly linked to the peptide backbone of the protease indicator. Thus, for example, the 5′-carboxytetramethylrhodamine (5-TMR) fluorophore may be linked to aspartic acid via the alpha amino group of the amino acid as shown in Formula V. The iodoacetamide group of rhodamine X acetamide (R492)) may be linked by reaction with the sulfhydryl group of a cysteine as indicated in formula VI. Means of performing such couplings are well known to those of skill in the art, and the details of one such coupling are provided in Example 1.

One of skill in the art will appreciate that when the peptide spacers (S¹ or S²) are present (as is discussed below), the fluorophores are preferably linked to the conformation determining regions through a reactive group on the side chain of the terminal amino acid of C¹ or C² as the spacers themselves form a peptide linkage with the terminal amino and carboxyl groups of C¹ or C² respectively.

Selection of Spacer Peptides and Linkage to a Solid Support

The fluorogenic protease indicators of the present invention may be obtained in solution or linked to a solid support. a “solid support” refers to any solid material that does not dissolve in or react with any of the components present in the solutions utilized for assaying for protease activity using the fluorogenic protease indicator molecules of the present invention and that provides a functional group for attachment of the fluorogenic molecule. Solid support materials are well known to those of skill in the art and include, but are not limited to silica, controlled pore glass (CPG), polystyrene, polystyrene/latex, carboxyl modified teflon, dextran, derivatized polysaccharides such as agar bearing amino, carboxyl or sulfhydryl groups, various plastics such as polyethylene, acrylic, and the like. Also of use are “semi-solid” supports such as lipid membranes as found in cells and in liposomes. One of skill will appreciate that the solid supports may be derivatized with functional groups (e.g. hydroxyls, amines, carboxyls, esters, and sulfhydryls) to provide reactive sites for the attachment of linkers or the direct attachment of the peptide.

The fluorogenic protease indicators may be linked to a solid support directly through the fluorophores or through the peptide backbone comprising the indicator. Linkage through the peptide backbone is most preferred.

When it is desired to link the indicator to a solid support through the peptide backbone, the peptide backbone may comprise an additional peptide spacer (designated S¹ or S² in Formula I). The spacer may be present at either the amino or carboxyl terminus of the peptide backbone and may vary from about 1 to about 50 amino acids, more preferably from 1 to about 20 and most preferably from 1 to about 10 amino acids in length. Particularly preferred spacers include Asp-Gly-Ser-Gly-Gly-Gly-Glu-Asp-Glu-Lys (SEQ ID NO:161), Lys-Glu-Asp-Gly-Gly-Asp-Lys (SEQ ID NO:162), Asp-Gly-Ser-Gly-Glu-Asp-Glu-Lys (SEQ ID NO:163), and Lys-Glu-Asp-Glu-Gly-Ser-Gly-Asp-Lys (SEQ ID NO:164).

The amino acid composition of the peptide spacer is not critical as the spacer just serves to separate the active components of the molecule from the substrate thereby preventing undesired interactions. However, the amino acid composition of the spacer may be selected to provide amino acids (e.g. a cysteine or a lysine) having side chains to which a linker or the solid support itself, is easily coupled. Alternatively the linker or the solid support itself may be attached to the amino terminus of S¹ or the carboxyl terminus of S².

In a preferred embodiment, the peptide spacer is actually joined to the solid support by a linker. The term “linker”, as used herein, refers to a molecule that may be used to link a peptide to another molecule, (e.g. a solid support, fluorophore, etc.). a linker is a hetero or homobifunctional molecule that provides a first reactive site capable of forming a covalent linkage with the peptide and a second reactive site capable of forming a covalent linkage with a reactive group on the solid support. The covalent linkage with the peptide (spacer) may be via either the terminal carboxyl or amino groups or with reactive groups on the amino acid side-chain (e.g. through a disulfide linkage to a cysteine).

Suitable linkers are well known to those of skill in the art and include, but are not limited to, straight or branched-chain carbon linkers, heterocyclic carbon linkers, or peptide linkers. As indicated above, the linkers may be joined to the carboxyl and amino terminal amino acids through their terminal carboxyl or amino groups or through their reactive side-chain groups.

Particularly preferred linkers are capable of forming covalent bonds to amino groups, carboxyl groups, or sulfhydryl. Amino-binding linkers include reactive groups such as carboxyl groups, isocyanates, isothiocyanates, esters, haloalkyls, and the like. Carboxyl-binding linkers are capable of forming include reactive groups such as various amines, hydroxyls and the like. Finally, sulfhydryl-binding linkers include reactive groups such as sulfhydryl groups, acrylates, isothiocyanates, isocyanates and the like. Particularly preferred linkers include sulfoMBS (m-maleimidobenzoyl-N-hydroxysulfosuccinimide ester) for linking amino groups (e.g. an amino group found on a lysine residue in the peptide) with sulfhydryl groups found on the solid support, or vice versa, for linking sulfhydryl groups (e.g. found on a cysteine residue of the peptide) with amino groups found on the solid support. Other particularly preferred linkers include EDC (1-ethyl-3-(3-dimethylaminopropryl)-carbodiimide) and bis-(sulfosuccinimidyl suberate). Other suitable linkers are well known to those of skill in the art.

The fluorogenic compounds of the present invention may be linked to the solid support through either the S¹ or the S² spacer such that the donor fluorophore is either retained on the solid support after cleavage of the molecule by a protease or such that the donor fluorophore goes into solution after cleavage. In the former case, the substrate is then assayed for fluorescence to detect protease activity, while in the later case the solution is assayed for fluorescence to detect protease activity.

Detection of Protease Activity

The present invention also provides methods for utilizing the fluorogenic protease indicators to detect protease activity in a variety of contexts. Thus, in one embodiment, the present invention provides for a method of using the fluorogenic indicators to verify or quantify the protease activity of a stock solution of a protease used for experimental or industrial purposes. Verification of protease activity of stock protease solutions before use is generally recommended as proteases often to loose activity over time (e.g. through self-hydrolysis) or to show varying degrees of activation when activated from zymogen precursors.

Assaying for protease activity of a stock solution simply requires adding a quantity of the stock solution to a fluorogenic protease indicator of the present invention and measuring the subsequent increase in fluorescence or decrease in exciton band in the absorption spectrum. The stock solution and the fluorogenic indicator may also be combined and assayed in a “digestion buffer” that optimizes activity of the protease. Buffers suitable for assaying protease activity are well known to those of skill in the art. In general, a buffer will be selected whose pH corresponds to the pH optimum of the particular protease. For example, a buffer particularly suitable for assaying elastase activity consists of 50 mM sodium phosphate, 1 mM EDTA at pH 8.9. The measurement is most easily made in a fluorometer, and instrument that provides an “excitation” light source for the fluorophore and then measures the light subsequently emitted at a particular wavelength. Comparison with a control indicator solution lacking the protease provides a measure of the protease activity. The activity level may be precisely quantified by generating a standard curve for the protease/indicator combination in which the rate of change in fluorescence produced by protease solutions of known activity is determined.

While detection of the fluorogenic compounds is preferably accomplished using a fluorometer, detection may by a variety of other methods well known to those of skill in the art. Thus for example, since the fluorophores of the present invention emit in the visible wavelengths, detection may be simply by visual inspection of fluorescence in response to excitation by a light source. Detection may also be by means of an image analysis system utilizing a video camera interfaced to an digitizer or other image acquisition system. Detection may also be by visualization through a filter as under a fluorescence microscope. The microscope may just provide a signal that is visualized by the operator. However the signal may be recorded on photographic film or using a video analysis system. The signal may also simply be quantified in real-time using either an image analysis system or simply a photometer.

Thus, for example, a basic assay for protease activity of a sample will involve suspending or dissolving the sample in a buffer (at the pH optima of the particular protease being assayed), adding to the buffer one of the fluorogenic protease indicators of the present invention, and monitoring the resulting change in fluorescence using a spectrofluorometer. The spectrofluorometer will be set to excite the donor fluorophore at the excitation wavelength of the donor fluorophore and to detect the resulting fluorescence at the emission wavelength of the donor fluorophore.

In another embodiment, the protease activity indicators of the present invention may be utilized for detection of protease activity in biological samples. Thus, in a preferred embodiment, this invention provides for methods of detecting protease activity in isolated biological samples such as sputum, blood, blood cells, tumor biopsies, and the like, or in situ, in cells or tissues in culture, or in section where the section is unimbedded and unfixed. The signal may be quantified using a fluorescence microscope, a fluorescence microplate reader, a fluorometer, or a flow cytometer.

Ex Vivo Assays of Isolated Biological Samples

In one embodiment, the present invention provides for methods of detecting protease activity in an isolated biological sample. This may be determined by simply contacting the sample with a fluorogenic protease indicator of the present invention and monitoring the change in fluorescence of the indicator over time. The sample may be suspended in a “digestion buffer” as described above. The sample may also be cleared of cellular debris, e.g. by centrifugation before analysis.

Where the fluorogenic protease indicator is bound to a solid support the assay may involve contacting the solid support bearing the indicator to the sample solution. Where the indicator is joined to the solid support by the side of the molecule bearing the donor fluorophore, the fluorescence of the support resulting from digestion of the indicator will then be monitored over time by any of the means described above. Conversely, where the acceptor molecule fluorophore is bound to a solid support, the test solution may be passed over the solid support and then the resulting luminescence of the test solution (due to the cleaved fluorophore) is measured. This latter approach may be particularly suitable for high throughput automated assays.

In Situ Assays of Histological Sections.

In another embodiment, this invention provides for a method of detecting in situ protease activity in histological sections. This method of detecting protease activity in tissues offers significant advantages over prior art methods (e.g. specific stains, antibody labels, etc.) because, unlike simple labeling approaches, in situ assays using the protease indicators indicate actual activity rather than simple presence or absence of the protease. Proteases are often present in tissues in their inactive precursor (zymogen) forms which are capable of binding protease labels. Thus traditional labeling approaches provide no information regarding the physiological state, vis a vis protease activity, of the tissue.

The in situ assay method generally comprises providing a tissue section (preferably a frozen section), contacting the section with one of the fluorogenic protease indicators of the present invention, and visualizing the resulting fluorescence. Visualization is preferably accomplished utilizing a fluorescence microscope. The fluorescence microscope provides an “excitation” light source to induce fluorescence of the “donor” fluorophore. The microscope is typically equipped with filters to optimize detection of the resulting fluorescence. Thus, for example, for the fluorogenic protease indicators described in Example 1, a typical filter cube for a Nikon microscope would contain an excitation filter (λ=550 12 nm), a dichroic mirror (λ=580 nm) and an interference-emission filter (λ=580 10 nm). As indicated above, the microscope may be equipped with a camera, photometer, or image acquisition system.

The sections are preferably cut as frozen sections as fixation or embedding will destroy protease activity in the sample.

The fluorogenic indicator may be introduced to the sections in a number of ways. For example, the fluorogenic protease indicator may be provided in a buffer solution, as described above, which is applied to the tissue section. Alternatively, the fluorogenic protease indicator may be provided as a semi-solid medium such as a gel or agar which is spread over the tissue sample. The gel helps to hold moisture in the sample while providing a signal in response to protease activity. The fluorogenic protease indicator may also be provided conjugated to a polymer such as a plastic film which may be used in procedures similar to the development of Western Blots. The plastic film is placed over the tissue sample on the slide and the fluorescence resulting from cleaved indicator molecules is viewed in the sample tissue under a microscope.

Typically the tissue sample must be incubated for a period of time to allow the endogenous proteases to cleave the fluorogenic protease indicators. Incubation times will range from about 10 to 60 minutes at temperatures up to and including 37 C.

In Situ Assays of Cells in Culture and Cell Suspensions Derived from Tissues and Biopsy Samples.

In yet another embodiment, this invention provides for a method of detecting in situ protease activity of cells in culture or cell suspensions derived from tissues, biopsy samples, or biological fluids (e.g., saliva, blood, urine, lymph, plasma, etc.). The cultured cells are grown either on chamber slides or in suspension and then transferred to histology slides by cytocentrifugation. Similarly, the cell suspensions are prepared according to standard methods and transferred to histology slides. The slide is washed with phosphate buffered saline and coated with a semi-solid polymer or a solution containing the fluorogenic protease indicator. The slide is incubated at 37 C for the time necessary for the endogenous proteases to cleave the protease indicator. The slide is then examined under a fluorescence microscope equipped with the appropriate filters as described above.

Alternatively, the cells are incubated with the protease indications at 37 C, then washed with buffer and transferred to a glass capillary tube and examined under a fluorescence microscope. When a flow cytometer is used to quantitate the intracellular enzyme activity, the cells with the fluorogenic indicator is simply diluted with buffer after 37 C incubation and analyzed.

Other Indicator Compositions

As explained above, it was a discovery of this invention that fluorescent molecules covalently attached on opposite sides of a backbone (e.g., peptide cleavage site) can quench by self-interaction (e.g., through the formation of dimers). Thus, in one embodiment, indicator molecules can be made using a single fluorophore rather than a matched donor-acceptor pair. Also, as explained above, particularly preferred fluorophores are those that form H-type dimers (e.g.,carboxytetramethylrhodamine, carboxyrhodamine-X, diethylaminocoumarin and carbocyanine dyes).

The use of single species labeled indicators, however, is not restricted to peptide-based compositions. To the contrary, “homo-double labeled” indicator molecules can utilize a variety of backbones including, but not limited to nucleic acid backbones, oligosaccharide backbones, lipid backbones, and the like. Methods of coupling fluorophores to such backbones are well known to those of skill in the art. For example, conjugation methods for attaching fluorophores to amino acids, peptides, proteins, nucleic acids, oligonucleotides, sugars , polysaccharides, proteoglycans, lipids, glycolipids and lipopolysaccharides, are described by Hermanson, (1995) Bioconjugate Techniques, Academic Press New York, N.Y., Kay M. et al., (1995) Biochemistry, 34: 293-300, and by Stubbs, et al. (1996) Biochemistry 35: 937-947.

Nucleic Acid Indicators.

Homo-doubly labeled nucleic acid backbones provide effective indicators for nucleic acid hybridizations and/or endonuclease activity. In this embodiment, a nucleic acid backbone is labeled with a self-quenching (e.g., H-type dimer-forming) fluorophore at the 3′ and 5′ end (either through a direct attachment or indirectly through (e.g., a peptide) linker). The nucleic acid backbone is selected to include self-complementary regions and thereby form a hairpin or other self-hybridized conformation that brings the fluorophores into proximity so that self-quenching occurs. When the indicator (probe) thus formed is hybridized to a complementary target nucleic acid, the self-hybridization is eliminated, the fluorophores are separated and the fluorescence signal produced by the molecule increases. Alternatively, the fluorescently labeled nucleic acid backbone can be used to assay for nuclease activity (e.g., restriction endonuclease or ribozyme activity). When the nucleic acid backbone is cleaved by a nuclcase (e.g., by restriction endonuclease recognition of a target site in the backbone) the fluorophores are separated again increasing the fluorescence signal. Methods of selecting appropriate nucleic acid backbones are described by Tyagi and Kramer et al. (1996) Nature Biotechnology, 14: 303-308.

The homo-doubly labeled fluorescently DNA probes can be used for detection, localization, or quantification of target DNA sequences in a variety of contexts. Thus, for example, the nucleic acid indicators of this invention can be used for rapid detection of amplification products in nucleic acid amplification (e.g., PCR) reactions. Here the indicator is selected with a backbone complementary to a region of the amplification product. As amplification product is produced the indicator hybridizes to the product and the fluorescence signal activity of the PCR solution increases. The nucleic acid indicators can be used as hybridization or nuclease activity indicators in a variety of other contexts. For example, in in situ hybridization (e.g., FISH) mapping of genomic DNA sequences can be accomplished using fluorescent probes to target particular regions within chromosomes (see, e.g., Meyne(1993) Chromosome mapping byfluorescent in situ hybridization, pp 263-268 In: Methods in Nonradioactive Detection G. C. Howard, ed., Appleton & Lange, Norwalk, Conn.; Morrison (1992) Detection of energy transfer and fluorescence quenching, pp. 311-352 In: Nonisotopic DNA Probes Techniques L. J. Kricka, ed. Academic Press, New York; and Varani (1995) Annu. Rev. Biophys. Biomol. Struct. 24: 379-404).

In another embodiment, the self-quenching fluorophores can be used to assay two molecule interactions (e.g., protein-protein, protein-nucleic acid, ligand-receptor, etc.). In this embodiment, one fluorophore is attached to one molecule (e.g., a protein) while the second fluorophore is attached to a second molecule (e.g., a second nucleic acid or a nucleic acid binding protein). When the two molecules bind, the fluorophores are juxtaposed and quench each other (e.g., through the formation of H-type dimers). The use of donor-acceptor resonance energy transfer systems to measure two molecule interactions is described by Bannwarth et al., Helvetica Chimica Acta. (1991) 74: 1991-1999, Bannwarth et al. (1991), Helvetica Chimica Acta. 74: 2000-2007, and Bannwarth et al., European Patent Application No. 0439036A2.

Oligosaccharide Indicators.

Homo-doubly labeled oligosaccharide backbone indicators permit the detection of glycosidase activity and lecithin binding protein identification. The fluorophores can be conjugated directly to an oligosaccharide or glycopeptide backbone or attached indirectly through (e.g., peptide) linkers. The oligosaccharides and/or glycopeptides can be chemically synthesized, recombinantly expressed, or isolated from natural sources such as fetuin and other glycoproteins by proteolytic fragmentation of the parent glycoproteins.

As in the case for oligonucleotides, an oligosaccharide specific structure may be selected for detection of a specific glycosidase, an enzyme that hydrolyzes bonds between two sugar molecules.

When a specific oligosaccharide or lecithin is selected to look for its lecithin binding protein, then the increased fluorescence indicates the complexation events that disrupt the H-type dimer, either by separating two dyes or distorting the relative orientation of two dyes. These effects result in increased fluorescence from the homo-double labeled probe.

Lipid Indicators

When a lipid, glycolipid or lipopolysaccharide are labeled with a self-quenching (e.g., H-type dimer forming) fluorophore and added to liposomes or other lipid (e.g., biological) membranes, a decrease in fluorescence will indicate H-type dimer formation and the degree of such fluorescence intensity will be an indication of the amount of H-type dimer formation. Because of the relative fluidity of a lipid membrane, the self-quenching fluorophores are able to interact (e.g. approach to a spacing of about 6 to about 10 Å) a stable H-type dimer results. When a membrane active agent, for example, an agent that affects either membrane fluid dynamics or permeabilization to a test compound, is added, then the observed fluorescence intensity changes indicate the test compound's ability to modify membrane fluidity or permeabilization. Hence, such labeled lipids are useful in drug screening and in development of lipid-drug delivery vehicles.

Similarly, the lipid-based probes of this invention can be used to similarly investigate the degree of lipid/protein interaction.

Cellular Uptake of Polypeptides

It was also a discovery of this invention that attachment of a hydrophobic protecting group to a polypeptide enhances uptake of that polypeptide by a cell. The effect is most pronounced when the polypeptide also bears a fluorophore, more preferably two fluorophores (see, Example 9). In certain preferred embodiments, however, the fluorophore(s) may double as the hydrophobic group. Preferred hydrophobic groups include, but are not limited to Fmoc, 9-fluoreneacetyl group (Fa), 1-fluorenecarboxylic group, 9-florenecarboxylic group, and 9-fluorenone-1-carboxylic group, benzyloxycarbonyl, Xanthyl (Xan), Trityl (Trt), 4-methyltrityl (Mtt), 4-methoxytrityl (Mmt), 4-methoxy-2,3,6-trimethyl-benzenesulphonyl (Mtr), Mesitylene-2-sulphonyl (Mts), 4,4′-dimethoxybenzhydryl (Mbh),Tosyl (Tos), 2,2,5,7,8-pentamethyl chroman-6-sulphonyl (Pmc), 4-methylbenzyl (MeBzl), 4-methoxybenzyl (MeOBzl), Benzyloxy (BzlO), Benzyl (Bzl), Benzoyl (Bz), 3-nitro-2-pyridinesulphenyl (Npys), 1-(4,4-dimentyl-2,6-diaxocyclohexylidene)ethyl (Dde), 2,6-dichlorobenzyl (2,6-DiCl-Bzl), 2-chlorobenzyloxycarbonyl (2-Cl-Z), 2-bromobenzyloxycarbonyl (2-Br-Z), Benzyloxymethyl (Bom), t-butoxycarbonyl (Boc), cyclohexyloxy (cHxO),t-butoxyrnethyl (Bum), t-butoxy (tBuO), t-Butyl (tBu), Acetyl (Ac), and Trifluoroacetyl (TFA).

The hydrophobic group can be coupled to the subject (e.g. indicator or inhibitor) molecule at essentially any convenient position. In preferred embodiments, the hydrophobic group is coupled at a position such that it does not interfere with recognition/binding of the subject molecule by a cognate binding partner (e.g., a protease). In a particularly preferred embodiment, where the subject molecule is a polypeptide, the hydrophobic group is attached to a terminus. The hydrophobic group can be attached directly to the subject molecule or it can be coupled via a linker. Linkers suitable for coupling hydrophobic groups are well know to those of skill in the art.

This invention thus provides a method of delivering a molecule (e.g., a polypeptide, oligonucleotide, oligosaccharide, a lipid, etc.) into a cell. The method involves providing the molecule to be delivered (e.g., polypeptide) having attached at least two fluorophore molecules and a hydrophobic group, more preferably an Fmoc group and then contacting the cell with the molecule.

It will be appreciated that where the peptide, oligonucleotide, oligosaccharide, or lipid is to be delivered in vivo for diagnostic end point or for therapeutic purposes, fluorophores and a hydrophobic group having reduced or no toxicity are preferred.

Thus, in a preferred embodiment, the fluorophores are replaced with non-toxic molecules having little or no biological activity. Preferred molecules are fused ring compounds that act as a linker joining the two ends of the molecule that is to be delivered. Particularly preferred fused ring compounds approximate the spacing of the exciton dimer.

Most preferred fused ring compounds include, but are not limited to steroids. The relatively flat and hydrophobic fluorophores that are known for H-type dimer formation can be replaced with similarly hydrophobic and structurally rigid and/or flat fused rings found, for example, in steroid molecules. a steroid derivative, e.g., a smaller than full steroid molecule, two to three fused six member ring molecules can be cross linked via usual cross linkers to provide a size and an over all hydrophobicity comparable to the Fmoc and other hydrophobic groups described herein. Since safe metabolic pathways exist for larger molecule consisting of these smaller building blocks, the toxicity of such hybrid molecules is expected to be small. In a preferred embodiment, the hydrophobic molecules are in a size range of about 17 by 12 Angstroms. It will be appreciated that where the peptide is to be delivered in vivo fluorophores of reduced or no toxicity are preferred. Toxicities of numerous fluorophores are well known to those of skill in the art (see, e.g., Haugland, Handbook of Fluorescent Probes and Research Chemicals, 6th Ed., Molecular Probes, Eugene, Oreg. (1996). In addition, toxicity (e.g., LD₅₀) can be readily determined according to standard methods well known to those of skill in the art. In a most preferred embodiment, the fused ring compound is a fused steroid such as structures XI and XII illustrated in Latt et al.(1965) J. Am. Chem. Soc., 87: 995-1003, where —OR₁ and —OR₂ can serve as activated points of attachment for the ends of peptides, nucleic acids or other molecules it is desired to transport into the cell.

As indicated above, the cellular uptake of almost any molecule will be enhanced by the attachment of the hydrophobic group and fluorophore or steroid cross-linkers. Thus, suitable molecules include virtually any molecule it is desired to introduce into the cell. Particularly preferred molecules include, but are not limited to, polypeptides (e.g., the protease inhibitors of this invention) and nucleic acids (e.g. oligonucleotide HIV inhibitors (see, e.g., Jing (1997) Biochem., 36: 12498-12505), ribozymes, peptide nucleic acids, and the like).

Activity Detection Kits

The present invention also provides for kits for the detection of protease activity in samples. The kits comprise one or more containers containing the fluorogenic protease indicators of the present invention. The indicators may be provided in solution or bound to a solid support. Thus the kits may contain indicator solutions or indicator “dipsticks”, blotters, culture media, and the like. The kits may also contain indicator cartridges (where the fluorogenic indicator is bound to the solid support by the “acceptor” fluorophore side) for use in automated protease activity detectors.

The kits additionally may include an instruction manual that teaches the method and describes use of the components of the kit. In addition, the kits may also include other reagents, buffers, various concentrations of protease inhibitors, stock proteases (for generation of standard curves, etc), culture media, disposable cuvettes and the like to aid the detection of protease activity utilizing the fluorogenic protease indicators of the present invention.

It will be appreciated that kits may additional or alternatively comprise any of the other indicators described herein (e.g., nucleic acid based indicators, oligosaccharide indicators, lipid indicators, etc). In this instance the kit will facilitate detection of the particular activities/compounds/interactions for which the particular indicator backbone is a substrate or binding agent.

Protease Inhibitors

It was also a discovery of this invention that the protease indicators can also act as protease inhibitors. Protease inhibitors and protease substrates share several basic properties such as ability to bind to protease's catalytic substrate binding site, and form a relatively stable complex with a protease. Hence, many normal substrates or their fragments exhibit competitive substrate inhibition at higher concentrations. The inhibition is competitive since the inhibitor binds to the same substrate binding site of the protease whereby it competes with the native substrate in binding to the protease's catalytic domain.

This invention provides three novel approaches for protease inhibitor design. In the first approach, a normal substrate is redesigned such that it binds to protease well, but has a reduced (slow or non-existent) hydrolysis rate. The slow hydrolysis rate is achieved by introducing an altered (different) conformation and/or conformational flexibility into the protease recognition domain. After the (e.g., native) substrate binds to the protease's substrate binding site, the conformation of the peptide bond between P₁ and P₁′ is distorted into a transition conformation of a given protease's peptide bond hydrolysis reaction. If this peptide bond as well as adjacent peptide bonds are altered such that they are not distortable then the hydrolysis rate will be reduced as compared to a substrate whose cleavage site peptide bond is easily distorted into the desired transition conformation. This approach is illustrated in Example 16 which shows how one can vary the hydrolysis rate of a substrate without changing the protease recognition amino acid sequences.

In a second approach, the inhibitor is produced by replacing the critical P₁ or P₁′ residue which makes it difficult to distort the cleavage site peptide bond. Normally, the amino acid side chains of P₁ and P₁′ residues interact specifically with the side chains of the protease catalytic domain. These specific interaction facilitate coordination of the peptide bond distortion into a transition conformation of the hydrolysis reaction. Thus, for example, when the critical P₁ residue of aspartic acid residue in the CPP32 protease substrate is replaced with non-charged asparagine then normal interaction between the substrate and protease does not take place even though the modified substrate binds to the protease's substrate binding site. Again, this leads to a slower or zero hydrolysis rate. The example of this P₁ residue substitution effect in designing an inhibitor is illustrated by the properties of the DEVN peptide (see, e.g., FIG. 5 and Example 12). The biological conformation that he substrate DEVN is an inhibitor is demonstrated in Example 13. Additional evidence that the peptide DEVN does bind to protease is given in Example 15.

The P₁′ residue can be changed to introduce either charged amino acid side chains or a structurally rigid, e.g., proline, residue as illustrated in the Table 3, substrate sequences for Hepatitis C viral protease substrate NS3 NS5A/5B of DVVCCSMS (SEQ ID NO:165, normal substrate) and DVVCCPdMS (SEQ ID NO:166, inhibitor). The underlined residues are the P₁ residues.

In a third approach, the amide bond between P₁ and P₁′ residues of a substrate can be changed to a non-hydrolyzable chemical bond including, but not limited to an ether, tioether, methylene bond, or alkylene (C═C) or ether bond (C—O—C(═O)) keeping the same amino acid side chains for the P₁ and P₁′ residues. Also the amide bond can be substituted with a retroinverso bond or other pseudoamino acid bond such as CH₂—NH or C(═O)—S replacing the carbonyl, group with a CH₂ group.

EXAMPLES

The invention is illustrated by the following examples. These examples are offered by way of illustration, not by way of limitation.

Example 1 Synthesis of Fluorogenic Molecule for Detecting Protease Activity

Peptide syntheses and derivitizations were performed as described in PCT publication PCT/US98/03000 (WO 98/37226) which is incorporated herein by reference.

Example 2 The Fluorogenic Protease Indicators Provide a Strong Signal when Digested

In order to demonstrate that the fluorogenic protease indicators of this invention are easily digested by a protease, the degree of cleavage was determined by assaying for the appearance of indicator cleavage products in the presence of a protease.

Approximately 1 microgram of protease indicator, having the formula F¹-Asp-Ala-Ile-Pro-Nle-Ser-Ile-Pro-Cys-F² (SEQ ID NO:177) where F¹ is a donor fluorophore (5′-carboxytetramethylrhodamine (5-TMR)) linked to aspartic acid via the alpha amino group and F² is an acceptor fluorophore (rhodamine X acetamide (R492)) linked via the sulfhydryl group of the cysteine was dissolved in a buffer consisting of 50 mM sodium phosphate, 1 mM EDTA at pH 8.9. To this solution was added 1 unit of elastase. The solution was analyzed by HPLC before and about 30 minutes after the addition of elastase. The digestion was carried out at 37° C. The HPLC separated components were monitored at a wavelength of 550 nm which allowed detection of both the 5-TMR fluorophore the R492 fluorophore and at 580 nm which allowed detection of the R492 fluorophore.

The results are indicated in FIG. 1 which shows the HPLC profiles of the fluorogenic protease indicator solution before and after addition of the protease elastase. FIG. 1(a) shows the HPLC before addition of the elastase showing a single peak representing the intact fluorogenic protease inhibitor. After addition of the elastase (FIGS. 1(b) and 1(c)) there was no trace of the late eluting single peak (FIG. 1(a)) indicating complete digestion of the fluorogenic protease indicator. In addition, the two predominant peaks in FIG. 1(b) and 1(c) indicate that the digestion occurred primarily at a single site. There are a few smaller peaks indicating a low degree of digestion at other sites within the peptide sequence, however, the striking predominance of only two digestion peaks suggests that these secondary sites were not readily accessible to the elastase.

Changes in the emission spectrum of the fluorogenic protease indicator after the addition of an elastase protease was monitored using an SLM spectrofluorometer model 48000 with slit widths set at 4 nm on both the excitation and emission sides. All measurements were carried out at 37 C.

Spectra in FIG. 2 show emission of the fluorogenic protease indicator (a) before and (b) after addition of elastase, while the time dependent increase of the indicator's donor fluorophore emission intensity, after addition of elastase, is plotted in FIG. 3. The fluorogenic protease inhibitor showed more than a 10 fold increase in fluorescence at 589 nm after treatment with the elastase protease (FIG. 2(a) compared to FIG. 2(b)) with over a 5 fold increase in fluorescence occurring within the first 1000 seconds of exposure to the protease. The changes in intensity between treated and untreated indicators are, to some degree, a function of slit widths used, since they represent the signal integrated across the particular slit width. Thus, if wider slit widths were used (e.g. 8 or 16 nm slits) an even greater signal would be provided in response to digestion.

Example 3 The Fluorescence Signal was Due to Intramolecular Energy Dequenching

In order to show that the fluorescence increase observed after protease treatment was due to intramolecular energy dequenching, the signal produced by elastase digestion of the fluorogenic protease indicator was compared to the signal produced by elastase treatment of the same peptide backbone coupled to either F¹ (5-TMR) or to F² (R492). The change in fluorescence intensity of the donor fluorophore after addition of 1 unit of elastase to equal concentrations of the double-fluorophore molecule and the two single-fluorophore molecules.

The results are illustrated in FIG. 4. The double-fluorophore molecule showed nearly complete quenching initially, followed by a dramatic increase in fluorescence after addition of the elastase which reached a constant value approximately 30 minutes after addition of the elastase (FIG. 4(a)). In contrast, the two single-fluorophore molecules showed virtually no initial quenching and no significant change in fluorescence after addition of the elastase. In fact, the fluorescence level was comparable to the fluorescence level of the fully digested double-fluorophore indicator molecule (FIG. 4(b)).

These results indicate that the increase in fluorescence intensity of the fluorogenic protease indicator is due to interruption of the resonance energy transferred intramolecularly from the donor fluorophore to the acceptor fluorophore and not to interaction between the fluorophore and the peptide backbone. This is significant since it is known that upon binding to a large protein or hydrophobic peptide the fluorescence of many hydrophobic fluorophores is quenched.

Example 4

Without being bound to a particular theory, it is believed that the fluorogenic protease indicators of the present invention achieve a high degree of protease specificity due to their folded structure, more particularly due to their relative rigid U-shaped conformation. The fluorescence obtained from the molecule reflects the average separation of two fluorophores. Thus, it was predicted that if the protease indicators existed in a relatively unfolded or flexible state, conditions that tend to cause unfolding (denaturation) would have little or no effect on the fluorescence of the molecule in the absence of a protease. Conversely, if the molecule is relatively rigid, then denaturing conditions would be expected to increase the fluorescence signal as the average separation of the fluorophores would be expected to increase thereby decreasing the quenching effect.

Thus, the effect of denaturing conditions on the fluorescence of the fluorogenic protease indicator in the absence of a protease was determined. First the change of fluorescence of the indicator of Example 1, as a function of added chaotropic reagent concentration (2M or 8M urea) was measured. When the fluorogenic protease indicator was denatured with a chaotropic reagent the fluorescence intensity increased with time to a plateau as the molecule denatured (unfolded).

These data indicate that the fluorogenic protease indicator normally exists in a stable folded conformation created by the conformation determining regions, as was predicted by a model based on an energy minimization algorithm. The plateau fluorescence level represents residual quenching of the fluorophores still joined by the fully denatured peptide backbone. Digestion of the extended (denatured) peptide results in greater than a 2 fold increase in fluorescence as the fluorophores are able to move farther away from each other.

Example 5 Quenching and Release of a Peptide Doubly-labeled with One Fluorophore

It was a surprising discovery of this invention that the peptide backbones of this invention doubly labeled with one fluorophore still achieve fluorescence quenching thus suggesting quenching through another mechanism besides resonance energy transfer.

In order to assess the extent ground-state dimerization and collisional quenching contribute to the total observed quenching, the series of doubly-labeled peptides listed in Table 11 was synthesized.

In addition to comparing absorption spectra of the dyes alone with the NorFes peptides singly labeled with each dye, emission spectra taken before and after cleavage were compared to determine the percent of quenching and the existence of fluorescent signal quenching by means other than resonance energy transfer (RET).

Fluorophores were linked to the amino terminus via the α-amino group of aspartic acid residue (D) and to the ε-amino group of lysine (K). Labeling was accomplished by the displacement of a succinimidyl group linked to 6-TMR or DER. The structure of the peptide, called NorFES-KGY is:

Fluorophore1-AspAlaIl ProNleS rIleProlysGlyTyr

-   -   Fluorophore2 (SEQ ID NO: 181).

As determined from absorption spectroscopy, all doubly-labeled peptides, except fluorescein-NorFES-fluorescein, showed the existence of so called ground-state dimers. This was indicated by shift of absorption maxima to shorter wavelengths as well as a shape change of the absorption spectra as compared with the spectra for the enzyme digested doubly-labeled samples. Upon cleavage with elastase, the ground-state dimers were destroyed and the resulting spectra were the same as a solution containing equal concentrations of the respective singly labeled peptides.

Without being bound to a particular theory, it is believed that the ground-state dimer formation observed in the compounds designed and synthesized according to the present invention indicates that the U-shaped conformation of the peptide backbone brings the fluorophores into close spatial proximity thus allowing overlap of electron orbitals of the two fluoropbores resulting in reciprocal quenching through ground-state dimerization. It was a surprising discovery that the polypeptides of this invention allowed the formation of ground-state dimers at a significantly lower dye concentration than previously observed. For example, ground-state dimerization of free fluorescein dye in solution was only observed at concentrations higher than 0.74 M, ground-state dimerization of free Eosin dye in solution was only observed at concentrations higher than 2.8×10⁻² M (see, Forster and Konig (1957) Zeitschrifl fur Electrochemie, 61: 344), and ground-state dimerization of Rhodamine B dye in solution was only observed at concentrations higher than 6×10⁻⁴ M (see Arbeloa and Ojeda (1982) Chemical Physics Letters, 87: 556). In contrast, in the present invention, the effects are observed at 4.0×10⁻⁷ M or about a 1000 fold lower concentration than the reported values.

The observation of the ground-state dimer for the compounds synthesized according to the present invention predicted a significant level of fluorescent quenching for doubly-labeled peptide with the same fluorophore as those compounds listed in Table 11. In fact this prediction was confirmed; a comparison of 6-TMR-NorFES-KGY-DER with 6-TMR-NorFES-KGY-6-TMR, i.e., the hetero doubly-labeled with the homo doubly-labeled peptides, indicates the degree of quenching is slightly higher in the hetero- vs. the homo- (94 vs. 90%). The fluorescein derivative, however, exhibited only 55% quenching. The symbols I₀ and I_(c) for the percent fluorescent quenching (% Q) refer to the fluorescence intensity for the intact labeled peptide and the enzyme digested labeled peptide solution respectively.

TABLE 11 Cleavage rate (T_(1/2)) and percentage of quenching (% Q) of hetero- and homo- labeled peptides. T_(1/2) is the time in seconds after addition of a protease (e.g. elastase) at which the fluorescence signal is ½ maximum. The symbols I_(o) and I_(c) refer to the fluorescence intensity (I) for the intact labeled peptide and the enzyme digested labeled peptide solution respectively. Compound T_(1/2) % Q-(1 − (I_(o)/I_(c))) × 100 6-TMR-NorFes-DER 80 94 6-TMR-NorFes-6-TMR 44 90 6-TMR-NorFes-6-TMR 44 90 DER-NorFes-DER 152 90 F1-NorFes-F1 18 55 6-TMR-NorFes-DER 80 94 6-TMR-K-NorFes-DER 125 97 6-TMR-NorFes-6-TMR 44 90 6-TMR-K-NorFes-6-TMR 84 92

The substrate sequence could be extended by one amino acid residue and the fluorophore could be attached through the episilon amino group on the lysine residue's side chain without major perturbation to the amount of observed quenching. Specifically, this addition (peptides designated K-NorFES-KGY) resulted in a slight decrease in cleavability rate and a very slight increase in the percent quenching for both the hetero- and homo-doubly-labeled peptide (in the K-NorFES-KGY peptides, N-terminal labeling was via the epsilon-amino group of lysine rather than the α-amino terminus).

Rates of cleavage (T_(1/2)) of these substrates by elastase were also measured by recording the time after addition of the protease at which the signal was one-half maximum (see, Table 11). a comparison of three homo-doubly-labeled peptides, i.e., NorFES-KGY labeled with two molecules of 6-TMR, DER, and fluorescein (F1), shows the order of cleavability to be: F1-NorFES-KGY-F1>6-TMR-NorFES-KGY-6-TMR>DER-NorFES-KGY-DER.

Example 6 Use of Homo-doubly Labeled Protease Indicators

In order to demonstrate the efficacy of the protease indicators of this invention in vitro, cells of the epidermal carcinoma cell line, A431, were grown to incomplete confluence in a Permanox tissue culture chamber slide (Nunc, Inc., Naperville, Ill., USA) in Dulbecco's Minimal Essential Medium (DME) containing 5% fetal calf serum (FCS). After removal of the medium, 200 μl of a solution containing 20% ethanol was added to each chamber and incubation was carried out for two minutes. The ethanolic medium was then removed and the monolayers were washed twice with DME (minus the FCS).

A DME solution containing 6-TMR-NorFes-6-TMR at a concentration of 1×10-−7 M was then incubated with the monolayer for 10 minutes. The cells were then examined for fluorescence with a Nikon fluorescence microscope using a rhodamine filter cube. (An advantage of using peptides homo-doubly-labeled with a single fluorophore compared to those labeled with two different fluorophores (hetero-doubly-labeled) is that fluorescence microscopy using homo-doubly-labeled peptides only requires a cutoff filter (i.e., a filter that transmits all light above a defined wavelength) on the emission side of the dichroic mirror, whereas fluorescence microscopy using hetero-doubly-labeled peptides preferably uses an interference filter (i.e., a filter that transmits light in a defined wavelength range (x y nm)).

Each cell was clearly defined by a diffuse red fluorescence (produced by the protease indicator cleaved by elastase) filling its entire cytoplasm. For cells at the edge of a confluent island, the black borders of the islands were clearly distinct from the red fluorescence in the cytoplasm of the cells indicating that the fluorescence was not due to background fluorescence or to cleavage of the protease indicator by the medium.

Example 7

In addition, we have synthesized and derivatized (homodoubly-labeled) PAI-2, CS-1 (a 31 residue long peptide) and two DEVD-like peptides that did not allow the dye-dye dimer formation. The CS-1 peptide shows that in a significantly longer peptide the dye-dye dimer structure can be formed. Note this peptide contains four proline residues in the amino terminal side of the putative cleavage site Ile-Leu bond. There is one proline in the carboxyl domain also. The results from the CS-1 peptide support a potentially larger sequence between the two dyes (fluorophores). Two DEVD-like peptide's amino acid sequences that did not allow the formation of productive H-type dimers are F₁—. AsRGluValAspGlvIleAspProLys(F₁)GlyTyr and Pro AstGluValAsPGlyIleAspProLys(F₁)GlyTyr.

Example 8 Cellular Uptake of Substrates Examined by Flow Cytometric and Fluorescence Microscopic Analysis

The compounds listed in Table 12 were synthesized and assayed for cellular uptake. Cellular internalization of the substrates was tested using Jurkat cells (a human acute T cell leukemic line), HL-60 cells (a human promyelocytic leukemic line), human lymphocyte lines, A1.1 cells (a murine T-cell line), and murine primary thymocytes. Procedures used in determining substrate uptake by viable cells are provided in Example 6 (for the HPLC procedures), in Example 2 (for the fluorescence microscopic analysis), and in Example 3 (for the flow cytometric analysis). a summary of these analyses with respect to cellular uptake of substrates is presented in this example.

TABLE 12 Compounds assayed for cellular uptake. Abbreviations used in the following table are: F¹: carboxytetramethylrhodamine; Z: benzyloxy- carbonyl group; Fm: Fmoc group; K[F1]: F¹ is covalently attached through the epsilon amino group of lysine (K). Single letter amino acid residues are used in the sequences except for Nlu for norleucine, B for aminoisobutyric acid and J for epsilon amino caproic acid residue. H: HPLC, FM: Fluorescence microscopy, FC: flow cytometry. Cellular uptake/ Uptake Seq ID Structure magnitude checked by NO 1 Fm-Lys(F1) Asp Ala Ile Pro Yes/high FM 182 Nlu Ser Ile Pro Lys (F1) Gly Tyr 2 Lys (F1) Asp Ala Ile Pro Nlu Yes/weak FM 183 Ser Ile Pro Lys (F1) Gly Tyr 3 Fm-Asp Ala Ile Pro Nlu Ser Ile No/ FM 184 Pro Lys (F1)Gly Tyr 4 Fm-Lys (F1) Asp Aib Asp Glu Yes/high FM & FC 185 Val Asp Gly Ile Asp Pro Lys (F1) Gly Tyr 5 Lys (F1) Asp Aib Asp Glu Val Yes/weak FM 186 Asp Gly Ile Asp Pro Lys (F1) Gly Tyr 6 Fm-Lys (F1) Asp Aib Asp Glu Yes/high FM 187 Val Ile Nlu Gly Ile Asp Pro Lys (F1) Gly Tyr 7 Lys (F1) Asp Aib Asp Glu Val Yes/weak FM & H 188 Nlu Gly Ile Asp Pro Lys (F1) Gly Tyr 8 Fm-Lys(F1) Asp Aib Glu Val Yes/high FM & FC 189 AspGlyIleAspProLys(F1)GlyTyr 9 Lys (F1) Asp Tyr Aib Ala Asp Yes/weak FM 190 Gly Ile Asp Pro Lys (F1) Gly Tyr 10 Fm-Lys (F1) Asp Aib Gly Asp Yes/high H & FC 191 Glu Val Asp Gly Ile Asp Gly Pro Lys (F1) Gly Tyr 11 Fm-Lys(F1) Asp Aib Ahx Gly Yes/high FC 192 Asp Glu Val Asp Gly Ile Asp Gly Ahx Pro Lys(F1)Gly Tyr 12 Z-Lys (F1) Asp Aib Ahx Gly Yes/weak FM 193 Asp Glu Val Asp Gly Ile Asp Gly Ahx Pro Lys (F1) Gly Tyr 13 Fm-Lys (F1) Asp Tyr Aib Ala Yes/high FM 194 Asp Gly Ile Asp Pro Lys (F1) Gly Tyr 14 Lys (F1) Asp Aib Glu Val Asp Yes/weak FM 195 Gly Ile Asp Pro Lys (F1) Gly Tyr

The data listed in Table 12 indicate that: (1) the presence of two fluorophores alone is not optimum for cellular uptake as illustrated by structures 2, 5, 7, and 9; (2) addition of a 9-fluorenylmethoxycarbonyl (Fmoc) group at the alpha amino group plus attachment of only one fluorophore, does not result insignificant cellular uptake (e.g., compound 3); and (3) two fluorophores plus at least one Fmoc group allows efficient cellular uptake of the substrates (structures 1, 4, 6, 8, 10, 11, and 12).

Other experiments utilizing protease substrates of this invention labeled with two optical fluorophores and at least one additional hydrophobic group such as an Fmoc group fits this paradigm. Replacing this Fmoc group with the less hydrophobic and smaller benzyloxycarbonyl group resulted in lower levels of cellular uptake, but was significantly better than a compound without a hydrophobic group such as DEVD peptide compound structure 5.

These data indicate that Fmoc may be replaced with Benzyloxycarbonyl, Z, or other hydrophobic groups such as Xanthyl (Xan), Trityl (Trt), 4-methyltrityl (Mtt), 4-methoxytrityl (Mmt), 4-methoxy-2,3,6-trimethyl-benzenesulphonyl (Mtr), Mesitylene-2-sulphonyl (Mts), 4,4′-dimethoxybenzhydryl (Mbb),Tosyl (Tos), 2,2,5,7,8-pentamethyl chroman-6-sulphonyl (Pmc), 4-methylbenzyl (MeBzl), 4-methoxybenzyl (MeOBzl), Benzyloxy (BzlO), Benzyl (Bzl), Benzoyl (Bz), 3-nitro-2-pyridinesulphenyl (Npys), 1-(4,4-dimethyl-2,6-diaxocyclohexylidene)ethyl (Dde), 2,6-dichlorobenzyl (2,6-DiCl-Bzl), 2-chlorobenzyloxycarbonyl (2-Cl-Z), 2-bromobenzyloxycarbonyl (2-Br-Z), Benzyloxymethyl (Bom), t-butoxycarbonyl (Boc), cyclohexyloxy (cHxO),t-butoxymethyl (Bum), t-butoxy (tBuO), t-Butyl (tBu), Acetyl (Ac), and Trifluoroacetyl (TFA).

When the acid groups on compound 5, DEVD peptide, were esterified with ethanol, this modified peptide did not show any enhanced cellular uptake by viable cells. Hence the importance of the Fmoc group and the two fluorophores forming H-type dimers are illustrated by this negative example.

Example 9 Fluorescence Microscopic Analysis of Cells Incubated with Elastase or Apoptosis-related Protease Substrates

The elastase substrate, Fm-Lys(F1)AspAlaIleProNluSerIleProLys(F1)GlyTyr, (SEQ ID NO:196, where F1 was carboxytetramethylrhodamine, Fm was Fmoc, K(F1) was F1 covalently attached through the epsilon amino group of lysine (K), and Fm—K is the Fmoc group covalently attached at the alpha amino group of the amino terminal lysine residue) was used with HL-60 cells. Cells were incubated with various concentrations of elastase substrate ranging from 10 nM to 10 μM for 5 minutes to 60 minutes. Then the cells were diluted 5-fold with RPMI 1640 medium containing 5% serum or with phosphate buffered saline. The samples were centrifuged and washed once more with 1 ml of washing solution. After centrifugation and removal of the washing solution, cell pellets were loosened with about 25 ul of medium and these cells were transferred to a glass capillary. Capillary tubes were then placed on a glass microscope slide and examined under a fluorescence microscope using standard rhodamine filters.

For apoptosis-related protease activity determination, 10 μM concentration of the compounds listed in Example 8 (compound structures 2 through 13) were incubated with cells for 30 min. to 3 hours. The cells were then washed similarly twice. Using glass capillary tubes, the washed cells were transferred and examined under a fluorescence microscope.

Example 10 Flow Cytometric Analysis of Cells Incubated with Apoptosis-related Protease Substrates

The concentration of substrates used in flow cytometric analysis was 10 μM in RPMI1640 medium containing 4 to 10% fetal calf serum. Cell densities during incubation with the chosen substrates ranged from 50,000 cells per ml to 4,000,000 per ml. Incubation times were from 30 min. to 3 hours at 37 degree C. and incubation volumes were 50 μl to 2 ml. After incubation with substrate for 30 to 60 min, cell suspensions were diluted 10-fold with ice cold Hank's Buffered Saline Solution (HBSS) and then filtered through a nylon fabric sheet. This filtered cell suspension was then subjected to flow cytometric analysis using a 488 nm excitation source. Becton Dickenson, Inc.'s flow cytometer, FacSort, was used in the flow cytometric analysis. Typically, 10,000 to 30,000 events per sample were collected.

Control cells without substrate incubation and the sample with the greatest expected fluorescence signals were used to set the instrument detector parameters. For example after 15 minutes incubation of Jurkat cells with substrate compound #11 Fm-CGD2D: Fm-CGD2D: Lys(F1)AspAibAhxGlyAspgluValAspGlyIleAspGlyAhxProLys(F1)GlyTyr (SEQ ID NO:197, where F1 was carboxytetramethylrhodamine; Fm was Fmoc, K(F1) was F1 covalently attached through the epsilon amino group of lysine (K), Nlu was norleucine, B was aminoisobutyric acid, and J was epsilon-aminocaproic acid) an increase of about 10 channels indicating cellular uptake of the substrates was measured. Note substrate #11 was not completely quenched. Hence, a small amount of background fluorescence would be expected from the intact substrate. Signals from the cells that had been activated with 1 ug/ml of ant-Fas antibody, CH11 clone for 1 to 6 hours indicated an increase in peak channel number. As much as a ten-fold increase in fluorescence intensity was observed. When the cells were co-incubated with the CPP32 protease inhibitor ZVAD-fluoromethylketone at 50 μM along with an apoptosis inducing agent, e.g., anti-Fas antibody, this observed increase in fluorescence intensity was eliminated. This indicated that the signal from compound 11 was due to the CPP32 protease activity which was inhibitable by ZVAD-FMK. Hence, the observed fluorescence intensity in each cell as determined by flow cytometric analysis served as a direct measure of the intracellular CPP32 protease activity.

Example 11 Competitive Substrate Inhibitors Illustrated by their Effects on Cell Lysate Hydrolysis of Apoptosis-related Protease Substrates

The level of CPP32 protease activity in the 6 hr ant-Fas-stimulated Jurkat cell lysate was examined using the protease substrate, DEVD-AFC (where AFC is aminofluoromethyl coumarin) 50 μM substrate concentration at 37 C. The buffer used was 50mM HEPES, pH7.5, 10%w/y sucrose, 0.1% w/v CHAPS.) Fluorescence intensity changes were monitored with an SLM 48000 spectrofluorometer. The hydrolysis rate of DEVD-AFC was found to depend upon the concentration of DEVD, DEVN, and ICE substrates (compounds 5, 7, and 9 in Table 12) present in the reaction mixtures. As the concentrations of DEVD, DEVN, and ICE were raised to 25 μM, the rate of DEVD-AFC hydrolysis was decreased. Hence, DEVD, DEVN and ICE substrates do bind to the substrate binding site of target proteases such as CPP32 and act as competitive inhibitors since their hydrolysis rates are slower than that of DEVD-AFC substrate. It was surprising to find that the substrate control peptide with its P₁ residue mutated with a conservative uncharged residue Asn still retained the ability to bind to the protease substrate binding site and exhibit enzyme inhibition.

Example 12 Substrates Delay and Inhibit Apoptosis Stimuli in Whole Cells

Jurkat cells are normally grown in 10% fetal calf serum containing RPMI 1640, at 37° C. in a 5% CO₂ atmosphere. When the serum content was dropped to 4%, the Jurkat cell growth rate not only slowed down but also a significant number of cells died within 36 hours. The cell density used was about 400,000 cell per ml. After 36 hours, control wells contained about 50% dead cells (trypan blue-positive cells), whereas the wells containing 0.1 or 1.0 μM concentration of compound #11 (Table 12) “Fm-CGD2D” or Fm-Lys(F1)AspAibAhxGlyAspgluValAspGlyIleAspGlyAhxProLys(F1)GlyTyr (SEQ ID NO:198) showed only 10% or 8% nonviable cells. Hence, compound #11 which exhibits efficient cellular uptake slowed down apoptosis in these Jurkat cells where it acted as a CPP32 protease inhibitor or a CPP32 activating protease inhibitor.

Example 13 Isolation of Intact and Cleaved Substrate Fragments from Cells

Jurkat cells, which had been induced into apoptosis by the ant-Fas antibody (1 ug/ml for 2 hours at 37 degree C.) were incubated with 10 μM substrate compound #10 Fm-G2D2D. After one hour incubation with this substrate, the cells were washed with 4% serum containing RPMI 1640 medium (1 ml wash solution for every 100 μl of incubation medium). Cells were washed three times, then solubilized with cell lysis buffer containing Triton X-100. This cell lysate was then analyzed using a C₄ reverse phase chromatography column and a water/acetonitrile eluent system containing 0.075% trifluoroacetic acid throughout. Analysis showed the presence of intact substrate with two major new peaks that eluted earlier than the intact substrate. The two recovered major peaks showed rhodamine absorption spectra; hence, these correspond to two major substrate fragments that are generated upon protease cleavage of the substrate.

Example 14 Fluorescence Signal from DEVN Substrates when Mixed with Target Enzyme Containing Solution

DEVN (10 μM), a substrate control peptide, compound 7 of Table 12, was found to be resistant to protease digestion by an apoptosis-activated Jurkat cell lysate. Extensive digestion time did not result in any further increase in fluorescence intensity. HPLC reverse phase analysis of this reaction mixture confirmed the presence of a totally uncleaved substrate. Substitution of the P₁ residue, Asp, by a non-charged amino acid Asn resulted in converting a protease substrate into a protease non-substrate.

This control peptide exhibited competitive substrate inhibition in the experiment as described in Example 12. In addition, fluorescence intensity monitoring as a function of time after addition of cell lysate showed a significant increase in fluorescence intensity initially but after 15 minutes this initial intensity level stabilized. Recalling that there was no substrate cleavage by the proteases present in the cell lysate, the best explanation of this initial fluorescence intensity is due to the DEVN substrate binding to the protease and the substrate undergoing a conformational change. This conformational change involving the substrate=s backbone also affects the conformation of two covalently attached fluorescent dye molecules with respect to each other in terms of mean distance and relative orientation. The degree of fluorescence quenching of these two fluorophores in the substrate structure has been found to be sensitive to their distance and the specific orientation with respect to their dipoles. Hence, any conformational change that affects these two aspects of the fluorescence reporting molecules would be expected to affect the fluorescence quenching as well. Thus, conformational changes induced by a substrate binding to a protease=s substrate binding site is reflected in the observed initial fluorescence intensity changes, i.e., an increase in its fluorescence intensity. Since the substrate cannot be cleaved, the initial fluorescence intensity increase levels off. One can utilize this observed fluorescence intensity increase due to conformational change of the substrate rather than substrate cleavage as a new kind of readout such as degree of association between the substrate and its target binding molecule.

Example 15 Variation of Hydrolysis Rates Induced by Varying the Flexibility of the Protease Recognition Domain by Various Conformation Determining Domain (CDR) Amino Acid Sequences

Protease cleavage sites of physiologically relevant substrates for any give protease may be classified into two cases. One is the serine protease inhibitors such as neutrophil elastase inhibitor or alpha 1 antitrypsin where the elastase recognition sequence is rather rigidly held by the remaining inhibitor molecule. Upon cleavage by elastase, this protease reactive site and its newly formed terminal residues undergoes significant conformational change as evidenced by high resolution crystallographic structural analyses of reactive and cleaved inhibitor structures. In the second class of protease cleavage sites the cleavage sequences are in domains where conformations are not well defined or there is a significant amount of flexibility, as much as in free linear peptides. It is said that some degree of defined conformation or a less than maximally available backbone flexibility difference between two potential substrates results in a given protease's preference for one substrate over another.

Compounds 4 (Fm-DEVD), 10 (Fm-G2D2D), and 11 (Fm-CGD2D) illustrate how one can introduce varying amount of constrained conformational space or flexibility into a given substrate with the same protease recognition domain but with different conformation determining domains or regions (CDRs) while retaining the bent forming function of the CDRs. This example illustrates how one can vary relative rigidity or flexibility of the central protease recognition domain by changing the CDR=s conformational flexibility or rigidity.

The parent compound Fm-DEVD has the following composition: Fmoc-Lys(F1)AspAibAspGluValGlyIleAspProLys(F1)GlyTyr (SEQ ID NO:199). The bold face underlined letters are the protease recognition sequence consisting of 7 amino acid residues. Compound #10 contains two glycine extensions at both ends of this protease recognition sequence. The central protease recognition domain now is 8 residues long GlyAspluValAspGlyIleAsp(SEQ ID NO:200), since the glycine residue at the amino terminus is a part of native sequence. The two glycine residues which are inherently more flexible than other amino acids, e.g., alanine, provide less conformational constraint or, conversely, more flexibility than compound 4 (Table 12) and thereby permit greater flexion when combined with Aib or Pro residues. Additional insertion of amino caproic acid at both termini with five methylene groups in addition to the one present in glycine provides further relaxation of the constrained conformation and, thus, greater flexibility for the protease recognition domain, GluValAspGlyIleAsp (SEQ ID NO:200). This progression of flexibility resulted in an increased hydrolysis rate with the CPP32 protease since CPP32 recognizes a more flexible protease recognition domain than does elastase. Support for this statement is that the CPP32 protease cleavage site in the proform of its physiological substrate, poly(ADP-ribose) polymerase, PARP, is located between two well-folded domains. Hence, it is expected that such a protease cleavage site would not be rigidly held or its conformation would be expected to be less defined than the remaining molecule. Hence, in order to provide these structural features to the substrate, introduction of flexible residues such as glycine, epsilon amino caproic acid, beta alanine, and amino butyric acid would be expected to play important roles in regulating the backbone flexibility of the substrate=s central protease recognition domain. These additional preferred residues for the conformation determining domain are also expected to provide the needed bend-inducing influence.

The observed varied hydrolysis rates for these three substrates point to success in regulating the flexibilities of the protease recognition domains. This, is reflected in the observed difference in proteolysis rates while maintaining the appropriate orientation for the two fluorophores to interact with each other through space. The importance of this conformation determining domain in providing a means to regulate flexibility as well as allowing the amino and carboxyl termini to orient in space appropriately close is exemplified by these compounds (4, 10, and 11).

These examples provide a tetrapeptide and a pentapeptide comprising Lys-Asp-Aib-Gly (SEQ ID NO:201) or Lys-Asp-Aib-Ahx-Gly (SEQ ID NO:202) where Ahx is episilon amino caproic acid (i.e. NH₂—(CH₂)₅—COOH). The fluorophore is attached to episilon amino group of the lysine residue. The carboxyl terminal CDR domain is defined as a tripeptide Gly-Pro-Lys and a tetrapeptide Gly-Ahx-Pro-Lys (SEQ ID NO:203). The hydrolysis rate was increased by 3-fold between compounds 4 (Fm-DEVD: Fm-Lys(F1)AspAibAspGluValAspGlyIleAspProLys(F1)GlyTyr, (SEQ ID NO:204) and 10 (Fm-G2D2D: Fm-Lys(F1)AspAibAspGluValAspGlyIleAspProLys(F1)GlyTyr, SEQ ID NO:205).

As illustrated in FIG. 5, the hydrolysis rate was further increased by ca. 3-fold over the above glycine residue insertion with the amino caproic amino acid (Ahx) addition, compound 11 (Fm-CGD2D: Fm-Lys(F1)Asp Aib Ahx Gly Asp Glu Val Asp Gly Ile Asp Gly Ahx Pro Lys (F1) Gly Tyr, SEQ ID NO:206). Hence, overall at least a 9-fold increase in substrate hydrolysis rate was accomplished (compounds 4 and 11, Table 12).

Example 16 Structural Characteristics of Fluorophores which Form Intramolecular H-type Dimers in a Protease Substrate

The strongest correlations between H-dimer formation and structural elements of various potential fluorophores for use in the homo-doubly labeled fluorogenic compositions of this invention are in order: delocalized charge, symmetry, and transition dipole magnitude. Hydrophobicity was not observed to be a major determinant in this type of dimerization.

In the experiments described herein, a new class of profluorescent protease substrate was designed and synthesized. These new fluorogenic indicators have spectral properties that fit the exciton model; More specifically, spectra of these polypeptides which were doubly labeled with rhodamines showed a blue-shifted absorption peak and fluorescence quenching, both indicators of H-dimer formation.

For example, NoreFes, an undecapeptide which is cleaved by the serine protease elastase, was homodoubly labeled on opposite sides of its cleavage site with six fluorophores in order to identify structural elements of dyes which influence intramolecular H-type dimer formation. Absorption and fluorescence spectra of these six substrate obtained before and after enzymatic cleavage suggest that the presence of a delocalized charge followed by symmetry and then magnitude of the lowest energy electronic transition dipole are important factors in dimer formation. Surprisingly, there was no evidence that hydrophobic interactions were important in the fluorophores used in this study.

The six fluorophores used in this study were rhodamine-X, tetramethylrhodamine, fluorescein, diethylaminocoumarin, hydroxycoumarin and pyrene.

While the xanthene components of these two rhodamines (rhodamine-X, tetramethylrhodamine) have the same charge and symmetric structure, the distinguishing characteristics between them are a higher transition dipole magnitude and lower hydrophobicity of the tetramethylrhodamine. One notes that the spectrum of the intact tetramethylrhodamine-derivatized substrate shows a more prominent change than that of rhodamine-X when comparing the absorption spectra of the two doubly-labeled intact peptides with those from the respective cleaved solution.

As noted above, in contrast to the two rhodamine derivatives where a charge of +1 is localized over each of the xanthene structure, the three conjugated ring component of the fluorescein was uncharged at pH 9. The lack of any significant shape changes in the absorption spectra after separation of the dyes (fluorescein) by cleavage of the peptide suggests a role for charge in H-dimer formation. The less pronounced, but nevertheless finite quenching observed with this derivative points toward a diminished but finite degree of interaction between two fluorescein compared with interactions between either of the two rhodamines is consistent with previous studies of xanthene in solution where the association constant for dimer formation for fluorescein is four order of magnitude lower than that for rhodamines.

The influence of dye symmetry was next examined using two coumarins, i.e. dicthylaminocoumarin and hydroxycoumarin. This class of molecules contains no symmetrical elements. The diethylaminocoumarin bears a positive charge delocalized over its two conjugated rings, similar to the rhodamines and the hydroxycoumarine is neutral at pH 9, similar to fluorescein. The spectrum of diethylaminocoumarin-labeled NorFes exhibits a blue shift of 11 nm while that of hydroxycoumarin-labeled NorFes shows just a slight blue shoulder. The respective degree of quenching, 76% and 28% of the intact peptides relative to the cleaved solutions is consistent with the importance of delocalized charge. Comparing the less pronounced spectral changes of the diethylaminocoumarin-derivatized peptide with those of the xanthene gives support to the role of symmetry as an important element in H-dimer formation.

Finally, the role of hydrophobicity was studied using pyrene, a fluorophore with S2 symmetry containing only carbons and hydrogens. No spectral changes were observed in either the absorption or the fluorescence mode and the magnitude of the transition dipole is extremely small. These results provide evidence against a dominant role for hydrophobicity in H-dimer formation.

In summary, the strongest correlations between H-dimer formation and structural elements are in order: delocalized charge, symmetry, and transition dipole magnitude. Hydrophobicity was not observed to be a major determinant in this type of dimerization.

The above examples are provided to illustrate the invention but not to limit its scope. Other variants of the invention will be readily apparent to one of ordinary skill in the art and are encompassed by the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference for all purposes. 

1. A fluorogenic composition for the detection of the activity of a protease, said composition having the formula:

wherein, P has the sequence Leu-Glu-His-Asp-Gly-Ile-Asn (SEQ ID NO:212); F¹ and F² are fluorophores and F¹ is attached to the amino terminal amino acid and F² is attached to the carboxyl terminal amino acid; S¹ and S², when present, are peptide spacers ranging in length from 1 to about 50 amino acids and S¹, when present, is attached to the amino terminal amino acid and S², when present, is attached to the carboxyl terminal amino acid; i, j, k, l, m, n, o, p, q, and r are independently 0 or 1; aa¹ and aa¹⁰ are independently selected from the group consisting of lysine, ornithine and cysteine; aa² and aa³ are independently selected from the group consisting of Asp, Glu, Lys, Ornithine, Arg, Citulline, homocitrulline, Ser, Pro, homoserine, Aib, Thr, and Tyr; aa⁸ and aa⁹ are independently selected from the group consisting of Asp, Glu, Lys, Ornithine, Arg, Citulline, homocitrulline, Ser, homoserine, Thr, and Tyr; aa⁵, aa⁴, aa⁶, and aa⁷ are independently selected from the group consisting of proline, 3,4-dehydroproline, hydroxyproline, alpha aminoisobutyric acid and N-methyl alanine; X is selected from the group consisting of Gly, βAla, γAbu, Gly-Gly, Ahx, 8-aminocaprylic acid, βAla-Gly, βAla-βAla, γAbu-Gly, βAla-γAbu, Gly-Gly-Gly, γAbu-γAbu, Ahx-Gly, βAla-Gly-Gly, Ahx-βAla, βAla-βAla-Gly, Gly-Gly-Gly-Gly (SEQ ID NO:223), Ahx-γAbu, βAla-βAla-βAla, γAbu-βAla-Gly, γAbu-γAbu-Gly, Ahx-Ahx, γAbu-γAbu-βAla, and Ahx-Ahx-Gly, Y is selected from the group consisting of Gly, βAla, γAbu, Gly-Gly, Ahx, 8-aminocaprylic acid, Gly-βAla, βAla-βAla, Gly-γAbu, γAbu-βAla, Gly-Gly-Gly, γAbu-γAbu, Gly-Ahx, Gly-Gly-βAla, βAla-Ahx, Gly-βAla-βAla, Gly-Gly-Gly-Gly (SEQ ID NO:223), γAbu-Ahx, βAla-βAla-βAla, Gly-βAla-γAbu, Gly-γAbu-γAbu, Ahx-Ahx, βAla-γAbu-γAbu, and Gly-Ahx-Ahx; when i is 1, S¹ is joined to aa¹ by a peptide bond through a terminal alpha amino group of aa¹; and when r is 1, S² is joined to aa¹⁰ by a peptide bond through a terminal alpha carboxyl group.
 2. The composition of claim 1, wherein the carboxyl terminal amino acid in which the carboxylic acid group is replaced with an amide.
 3. The composition of claim 1, wherein r is zero; and aa¹⁰ has a C-terminal amide group or free carboxylic acid group.
 4. The composition of claim 1, having the amino acid sequence Lys-Asp-Pro-Ahx-Gly-Leu-Glu-His-Asp-Gly-Ile-Asn-Gly-Ahx-Pro-Lys-Gly-Tyr (SEQ ID NO:248).
 5. The composition of claim 1, wherein F¹ and F² are the same fluorophore.
 6. The composition of claim 5, wherein said F¹ and F² have an excitation wavelength in the range of 315 nm to 750 nm.
 7. The composition of claim 1, wherein the F¹ molecule is attached through either an α-amino group of the aa¹ amino acid or through a side chain amino group of the aa¹ amino acid, or through a sulfhydryl group of a side chain of the aa amino acid.
 8. The composition of claim 1, wherein the F² molecule is attached either through a side chain amino group of the aa¹⁰ amino acid, through a carboxyl group of the aa¹⁰ amino acid, or through a sulfhydryl group of a side chain of the aa¹⁰ amino acid.
 9. The composition of claim 1, wherein said fluorophore is selected from the group consisting of rhodamine X, 9-(2,5-dicarboxyphenyl)-3,6-bis(dimethylamino)xanthylium halide or other anion, 9-(2,6-dicarboxyphenyl)-3,6-bis(dimethylamino)xanthylium halide or other anion, 9-(2,5)-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium halide or other anion (Rh6G), 9-(2,6)-dicarboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium halide or other anion, 9-(2,5-dicarboxyphenyl)-3,6-bisamino-xanthylium halide or other anion (Rh110), 9-(2,6-dicarboxyphenyl)-3,6-bisamino-xanthylium halide or other anion (Rh110), 9-(2,5)-dicarboxyphenyl)-3-amino-6-hydroxy-xanthylium halide or other anion (Blue Rh), 9-(2,6)-dicarboxyphenyl)-3-amino-6-hydroxy-xanthylium halide or other anion (Blue Rh), carboxytetramethylrhodamine, carboxyrhodamine-X, diethylaminocoumarin, 9-(2,5-dicarboxyphenyl)-3,6-bis-(dimethylamino)xanthylium chloride (5-TMR), 9-(2,6-dicarboxyphenyl)-3,6-bis-(dimethylamino)xanthylium chloride (6-TMR), 9-(2-carboxyphenyl)-2,7-dimethyl-3,6-bis(ethylamino)xanthylium, 9-(2-carboxyphenyl)-3,6-bis(dimethylamino)xanthylium, 9-(2-carboxyphenyl)-xanthylium, and a carbocyanine dye.
 10. The composition of claim 1, wherein said composition bears a hydrophobic group.
 11. The composition of claim 4, wherein said composition bears a hydrophobic group.
 12. The composition of claim 11, wherein said hydrophobic group is selected from the group consisting of: Fmoc, 9-fluoreneacetyl group (Fa), 1-fluorenecarboxylic group, 9-florenecarboxylic group, and 9-fluorenone-1-carboxylic group, benzyloxycarbonyl, Xanthyl (Xan), Trityl (Trt), 4-methyltrityl (Mtt), 4-methoxytrityl (Mmt), 4-methoxy-2,3,6-trimethyl-benzenesulphonyl (Mtr), Mesitylene-2-sulphonyl (Mts), 4,4-dimethoxybenzhydryl (Mbh), Tosyl (Tos), 2,2,5,7,8-pentamethyl chroman-6-sulphonyl (Pmc), 4-methylbenzyl (MeBzl), 4-methoxybenzyl (MeOBzl), Benzyloxy (BzlO), Benzyl (Bzl), Benzoyl (Bz), 3-nitro-2-pyridinesulphenyl (Npys), 1-(4,4-dimethyl-2,6-dioxocyclohexylidene)ethyl (Dde), 2,6-dichlorobenzyl (2,6-DiCl-Bzl), 2-chlorobenzyloxycarbonyl (2-Cl-Z), 2-bromobenzyloxycarbonyl (2-Br-Z), Benzyloxymethyl (Bom), t-butoxycarbonyl (Boc), cyclohexyloxy (cHxO),t-butoxymethyl (Bum), t-butoxy (tBuO), t-Butyl (tBu), Acetyl (Ac), and Trifluoroacetyl (TFA).
 13. The composition of claim 12, wherein said hydrophobic group is Fmoc.
 14. The composition of claim 12, wherein said hydrophobic group is Fa.
 15. The composition of claim 12, wherein said hydrophobic group is attached to the amino terminus of the molecule. 